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ABSTRACT 
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structure-mapping theory. This approach presents analogy elaboration 
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focused on correspondence inadequacies. The second issue is the 
evaluation the quality of a proposed analogy and its use for some 
performance task. A theory of verification-based analogical learning 
is presented to address the tenuous nature of analogically inferred 
concepts, and procedures for increasing confidence in the inferred 
knowledge are described. Specifically, it relies on analogical 
inference to hypothesize new theories and simulation of those 
theories to analyze their validity. It represents a view of euialogy 
as an iterative process of hypothesis formation, testing, and 
revision. These ideas are illustrated via PHINEAS, a computer program 
that uses similarity to posit qualitative explanations for 
time-varying descrip+^ions of physical behaviors. It builds upon 
existing work .^n qualitative physics to provide a means with which to 
describe and reason with theories of the physical world. A 144- item 
list of references is included. (TJH) 



* Reproductions supplied by EDRS are the best that can be made 

* from the original document. 



2 DEPARTMENT OF COMPUTER SCIENCE 

^ UNIVERSITY OF ILLINOIS AT U R B A N A - C H A M PA I G N 
CO 




REPORT NO. UIUODCS-R-88-1479 



UILU-ENG-88-1785 



ERIC 



LEARNING FROM PHYSICAL ANALOGIES: 
A STUDY IN ANALOGY AND THE EXPLANATION PROCESS 



December 1988 



BrUn Falkenhamer 



BESl COPY AVAILABLE 



m TMiaiac B«nreh Iropaau, Coalnet No. N0a014-«5-K.-085«. 

Rtprodvetiaa ia vholt or put ii pvmitM for lay pwpow of the 
Uaitod 8«tt« GomBMt 
Apvpmed for piblie nI««m; diiMbetioB uliaital. 2 



CUWTV CU«iri6kTI0f< 6f rUA VSST 



1 liiPORT SECURITY CUSSIFKIaTION 

Laaalfled 



REPORT DOCUMENTATION PAGE 



lb. RESTRiaiVE MARKINGS 



RITY CLASSIFICATION AUTHORITY 



OiClASSIFtCAnON/OOWNORAOiNC SCHEDULE 

ERfORMiNG ORGANIZATION REPORT NUM8ER(S) 
JCDCD*R-88*1479 



3. DISTRIBUTION /AVAILABILITY OF REPORT 

Approved for public release; 
Distribution unlimited 



S. MONITORING ORGANIZATION REPORT NUMBER(S) 



I NAME OF H\H>mm ORGANIZATION 

diversity of Illinois 
}t. of Computer Scien ce 

lAODR^SS (Cty, 5r«ft, «ntf z/PCocte) 
304 W. Springfield Ave. 
bbana. IL 61801 



6b. OFFia SYMBOL 

(If sppiksbh) 



7a. NAME OF MONITORING ORGANIZATION 

Office of Naval Research 

Cognitive Science Division (Code 1142CS) 



7b ADDRESS (Oiy, StM, snd 2IPCod§) 
800 N. Qulncy St. 
Arlington, VA 22217-5000 



INAME OF FUNDING /SPONSORING 
I ORGANIZATION 



ADDRESS (Oty, SUt9, snd Z/PCodt) 



8b. OFFICE SYMBOL 
Of spplkMbf^) 



9. PROCUREMENT INSTRUMENT IDENTIFICATION NUMBER 

N00014-89-J-1272 



10. SOURCE OF FUNDING NUMBER^ 



PROGRAM 

ELEMENT NO. 

1153N 



PROJEa 


TASK 


NO. 


NO 



WORK UNIT 
ACCESSION NO 

NR 442f-007l 



[nrCE (9ndud§ security Omficstion) 
Brnlng From Physical Analogies: A Study in Analogy and the Explanation Process 
(Approved for public release; distribution unlimited) 



ERSONAL AUTHOR(S) 
. TYPE OF REPORT 



SUPPLEMENTARY NOTATION 



|l3b. TIME COVERED 
FROM^j^^ , TO 



|l4, DATE OF REPORT {Y9Sr, Month, Dsy) 
<U^t-2^il 12-27-88 



IS. PAGE COUNT 



COSATI CODES 



FIELD 



GROUP 



10 



SUB-GROUP 



18. SUBJEa TERMS {Continue on rtvcrM if ntcmry 4nd khntify by biock number) 
Structure-mapping theory contextual structure-mapping 
verification-based analogical learning PHINEAS 

ISTRAa (Continue on nvom if ntctmry snd idontify by bhdc numbor) 
to make programs that understand and interact with the world as well as people do, we must 
■plicate the kind of flexibility people exhibit when conjecturing plausible explanations 
■ the diverse physical phenomena they encounter. This process often involves drawing upon 
Ihyalcal analogies—viewing the situation and its behavior as similar to familiar phenomena, 
gajecturlng that they share analogous underlying causes, and using the plausible Interpre- 
|Clon as a foochold to further understanding, analysis, and hypothesis refinement. 

Alia report investigates analogical reasoning and learning applied to the task of construct- 

Ig qualitative explanations for observed physical phenomena. Primary emphasis is placed on 
}p central questions. First, how are analogies elaborated to sanction new inferences about 
novel situation? This problem is addressed by conesxtual structure-mapping , a knowledge- 

renslve adaptat:».on of Centner's structure-mapping theory. It presents analogy elaboration 
• mp and analyze cycle, in which two situations are placed in correspondence, followed 
problem solving and inference production focused on correspondence inadequacies^ Second, 



□ DTIC USERS 



lOtSTRIBUnON / AVAIlAilUTY OF ABSTRAa 
ByNQASSIFIE D/UNUMITED □ ^M E AS r^PT 
P NAME OF RESPONSIBLE INDIVIDUAL 

rDRMI 473, 84 MAR 83 APR tdition may bt ustd until txhaufttd 

j^P^^ AJIothtrtditionsarf obiolttt. ^ 



21. ABSTRAa SECURITY CLASSIFICATION 

Unclassified 

22b. TELEPHONE f/nc/Mdt Artj Codt) 1 22c. OFFICE SYMBOL 
-4318 i ONR 1142CS 



SECURITY CLASSIFICATION pp TUl^ PAq^ 



19. ABSTRACT (cont.) 



how Is the quality of a proposed analogy evaluated and used for some performance 
task? A theory of ver If lea t Ion-based analogical learning Is presented which 
addresses the tenuous nature of analogically Inferred concepts and describes 
procedures that can be used to Increase confidence In the Inferred knowledge. 
Specifically, It relies on analogical Inference to hypothesize new theories and 
simulation of those theories to analyze their validity. It represents a view 
of analogy as an Iterative process of hypothesis formation, testing, and 
revision. 

These Ideas are Illustrated via PHINEAS, a program which uses similarity to posit 
qualitative explanations for time-varying descriptions of physical behaviors. 
It builds upon existing work in qualitative physics to provide a rich environment 
in which to descrlSe and reason with theories of the physical world. 



©Copyright by 
Brian Carl Falkenhainei 
1989 



ERIC 



LEARNING FROM PHYSICAL ANALOGIES: 
A STUDY IN AJfALOGY AND THE EXPLANATION PROCESS 



BY 



BBIAN CARL FALKENHAINER 

B.S., Santa Clara Uniyersity, 1982 
M.S., UniTersity of Illinois, 1985 



THESIS 

Submittfid in partial fulfillment of the requirements 
for the degree of Doctor of Philosophy in Computer Science 
in the Graduate College of the 
University of Illinois at Urbana-Chancpai^, 1989 



Urbana, Illinois 



LEARNT^G FROM PHYSICAL ANALOGIES: 
A STUDY IN ANALOGY AND THE EXPLANATION PROCESS 

Brian Carl Falkenliainer, Ph.D. 
Department of Computer Science 
University of Illinois at Urbana-Champaign, 1989 
Kenneth D. Forbos, Advisor 



To make programs that understand and interact with the world as well as people do, 
we must duplicate the kind of flexibility people exhibit when conjecturing plausible ex- 
planations of the diverse physical phenomena they encounter. This process often involves 
drawing upon physical analogies - viewing the situation and its behavior as similar to fa- 
miliar phenomena^ conjecturing that they share analogous underlying causes, rjxd using the 
plausible interpretation as a foothold to further understanding, analysis, and hypothesis 
refinement. 

This thesis investigates analogical reasoning and learning applied to the ta.^k of con- 
structing qualitative explanations for observed physical phenomena. Primary emphasis is 
placed on two central questions. First, how are analogies elaborated to sanction new infer- 
ences about a novel situation? This problem is addressed by contextual structure-mapping, a 
knowledge-intensive adaptation of Centner's structure-mapping theory. It presents analogy 
elaboration as a map and analyze cycle, in which two situations are placed in correspon- 
dence, followed by problem solving and inference production focused on correspondence 
inadequacies. Second, how is the quality of a proposed analogy evaluated and used for 
some performance task? A theory of verification-bMcd analogical learning is presented 
which addresses the tenuous nature of analo^cally inferred concepts and describes pro- 
cedures that can be used to increase confidence in the inferred knowledge. Specifically, it 
relies on analogical inference to hypothesize new theories and simulation of those theories to 
analyze their validity. It represents a view of analogy as an iterative process of hypothesis 
formation, testing, and revision. 

These ideas are illustrated via PHIHEAS, a program which uses similarity to posit qual- 
itative explanations for time-varying descriptions of physical behaviors. It builds upon 
existing work in qualitative physics to provide a rich environment in which to describe and 
reason with theories of the physical world. 
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Chapter 1 

Learning from Physical Analogies 



We cam coming into something new, deal with it except on the basis of the 
familiar and the old-fashioned... We cannot leam to be surprised or astonished at 
something unless we have a view of how it ought to be; and that view is almost 
certainly an analogy. We cannot leam that we have n:iade a mista^ ^ unless we can 
make a mistake; and our mistake is ahnost always in the form of an analogy to some 
other piece of experience. (Oppenheimer, 1956, pg. 129-130) 

To make programs that understand and interact with the world as well as people do, 
we must duplicate the kind of flexibility people exhibit when coiyecturing plausible c .^la- 
nations of the diverse physical phenomena they encounter. I view this flexibility as arising 
from an ability to detect similarities, within and across domains, between the various phe- 
nomena. Interpreting an observation often requires the flexible integration of knowledge 
from multiple sources and the formation of new theories about the world. For example, 
suppose some unusual behavior is observed. What is causing it? In general, the process 
of constructing a satisfactory explanation will involve drawing upon physical analogies - 
viewing the situation ^nd its behavior as similar to familiar phenomena, conjecturing that 
they share analogous underlying causes, and using the plaurible interpretation as a foothold 
to farther understanding, analysis, and hsrpothesis refinement. 

The goal of this work is co develop a system that can offer plausible answers to the 
types of questions shown in Figure 1.1 by relating these unfamiliar situations to more 
familiar concepts. The first example, a hot brick immersed in cold water, is taken from 
the classic ^uid flow - heat flow analogy that led to the development of the caloric theory 
of heat. The seccmd example, a beach ball suspended in a jet of air, demonstrates how 
approximate hypotheses proposed through similarity may be discriminated by envisioning 
their conseqti^Jices. The third example demonstrates how the same analogy approach may 
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• What causes a hot brick and cold water to change to the same median temperature 
when the brick is immersed in the water? 

This situation may be explained if it is assumed analogous to liquid flowing from ->ne container 
to Bmt^-a. The assumption is 8upp3rted by the brick's temperature and the water's temperature 
asymptotically approaching equality. It requires further assumptions that the brick and water 
contain hypothesized substance sk-water-6 and that the brick and water temperatures are pro- 
portional to the amount of sk-water-6 they contain. Then, when the brick and water are placed 
in contact, their difference in tciuperatures causes the transfer of sk-water-6 from the brick to the 
water. This explanation consistently predicts the observed behayior. 



Air 
Column 




Beachball 



• A beachball may be suspended in a vertical column of flowing air, as in a vacuum 
cleaner set in reverse. What holds the ball in place and why is it so stable? 

Two explanations may be proposed. In one, the air is pushing on either side of the ball, holding 
it in place. In the other, an air pressure difference is pulling on either side of the ball, holding it 
in place. However, the first explanation fails to accurately explain the situation, since offsef ting 
the ball to the right results in increased force towards the right, wluch is not stable. The second 
explanation consistently predicts the observed behavior, since offsetting the ball to the right 
results in increased force to the left, which is stable. 




t A beaker and a vial, each containing water, are connected by objects. What is causing 
the water in the beaker to decrease while the water in the vial is increasing? 

This situation appears to be an instance rf liquid flow if it is assumed that objects is a fluid 
path. Then, the beaker pressure being greater than the vial pressure would cause the water to 
flow from the beaker to the vial thiough objects, until their pressures are equal. This explanation 
consistently predicts the observed behavior. 

Figure 1.1: The types of questions this thesis is designed to answer. 
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be used to provide answers to conventional abductive inference situations, in this case by 
finding that liquid flow is the best analogue to the given liquid flow scenario. 

This chapter begins by suggesting that explanation and analogy share a common core, 
in which the search for explanatory similarity is the driving force behind all forms of expla- 
nation. Crucial to such a unified view is a powerful model of analogy, which is summarized 
in Section 1.2. Section 1.3 then defines a special case of general explanatory analogies, 
physical analogies. Section 1.4 introduces PHINEAS, a program which uses physical analo- 
gies to provide plausible explanations of observed physical phenomena and enhance its 
understanding of the physical world. Finally, Section 1.5 provides a reader's guide to the 
subsequent chapters of this thesis. 

1.1 Explanation 

An important aspect of understanding and interacting with the physical world is proposing 
causal explanations for what we see around us. When existing knowledge is suflicient for 
explanation, this is commonly called the interpretation or diagnosis task: select the theory 
that provides the best account of the phenomenon. When knowledge is lacking, the task 
becomes one of theory formation: conjecture a new or revised theory that will account 
for the phenomenon. Typically, explanation, interpretation, and diagnosis are decoupled 
from theory formation in AI. Yet they are intimately related, representing different levels of 
certainty within a common process. Interpretation typically invcdves making assumptions 
due to incomplete knowledge about the situation. Theory 'Armation typically involves 
making assumptions about both the situation and the incompleteness of current theories. 

Abduction is the process that generates explanations. It is a form of inference that fits 
the following pattern (Josephson et al., 1987):^ 

I> is a collection of data (facts, observations, givens); 

H explains D (would, if true, imply D); 

no other hypothesis explains D as well as H does; 

therefore, H is correct. 

For example, if we see that the grass is wet, and we know that rain makes the grass 
wet, we might hypothesize that it had* rained recently. If the sky was cloudy, this would 

^There is a possible distinction between the abduction process, Pormally associated with backchaining, 
and its ultimate product, a deductive proof tree normaUy having some auamptions at the leaves. Through- 
out this chapter, we are primarily interested in the abstract product, independent of backchaining, in which 
assumption of some unknown antecedent facts is required to complete the explanation. 

3 
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(a) (b) 

Figure 1.2: Two extremes of abduction: (a) simple backward chaining on an atomic goal 
and (b) best match relating explanation patterns to observation patterns. 



lend credence to our hypothesis. However, this is a guess. The sprinklers might have been 
on recently (Pearl, 1987). 

Abduction is inference to the best explanation, that is, if the hypothesis were true, it 
would explain the phenomenon. There are two key phrases here. "If it were true" indicates 
that not all of the relevant knowledge may be known and assumptions may be required 
to fill in the gaps. The process of finding candidates and the assumptions that must be 
made along the way will be called the interpretation-construction task. "It would explain 
the phenomenon" indicates that the hypothesis would explain the phenomenon, not that 
it is the correct explanation. There may be other hypotheses that would explain it as 
well. The process of deciding which hypothesis is the best explanation will be called the 
interpretation-selection task. 

Analogy and abduction share both interpretation tasks. They must each propose a 
plausible set of candidates that fit the current situation and they must each select from 
those candidates the one(s) that will be accepted as the final interpretation. The model of 
explanation developed in this thesis is based on the view that analogy suffices as the central 
process model for explanation tasks. There are two arguments supporting this view. 

First, consider the basic abduction process. The backchaining model of abduction 
works well for explaining simple atomic occurrences (Figure 1.2(a)), such as Wet (grass). 
However, as the complexity of the phenomenon being explained increases, our ability to 
simply backchain to a small set of plausible causes diminishes. The entirety of the situation 
must be considered and all of the interrelations between aspects taken into account (Fig- 
ure 1.2(b)). In a complex, multi-faceted phenomenon, meaning arises out of consideration 
of the whole. Hence, most multi-faceted explanation systems are based on some form of 
macro-matching, typically in terms of schemas or frames, that seeks minimal hypothesis 
sets maximally fitting the data. This is true of script-based models of story understand- 
ing (e.g., Charniak, 1972; DeJong, 1982), process models for interpreting the behavior of 
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# - Known O - Assumed 
yC^' False 0- Analogical Assumption 



(c) 



(d) 



Figure 1.3: Four alternative explanation scenarios: (a) deduction scenario, (b) assumption 
scenario, (c) generalization scenario, (d) analogy scenario. 



a physical system (e.g., Forbus, 1986a), and composite matching models of abduction and 
diagnosis (e.g., Reggia, 1983; Josephson et al., 1987). The desire for a minimal hypothesis, 
best match is further imr^idtly reflected in the Occam's razor heuristic found in simpler 
models, which backtrack on one piece of data at a time (e.g., Pople, 1973). In other words, 
interpretation and explanation are a form of best match process, with the goal of matching 
the current situation to that which could explain it. Adaptability can be achieved without 
loss of function by generalizing the identicality of unification-based models with the more 
general, constrained similarity match of analogy. 

Second, consider the following explanation scenarios (Figure 1.3): 

Deduction scenario: Given phenomenon Vj where V represents a set of observables, 
a complete explanation of V deductively follows from existing knowledge. The only open 
question is if it is the explanation, as there may be others. For example, suppose fluid 
flow is observed and all of the preconditions for fluid flow are known to hold (e.g., the 
source pressure is greater than the destination pressure, the fluid path is open, etc.). Then 
a fluid flow explanation directly follows. Given the observed behavior and the existing 
preconditions, we could say that the situation is trivially analogous, or literally similar to 
liquid flow. 

Assumption scenario: Phenomenon where V represents a set of observables, is given. 
No explanation can be grounded with current knowledge because not all of the relevant facts 
are known. However, a complete explanation follows from the union of existing knowledge 
and a consistent set of asstmxptions about the missing facts. For example, if liquid flow is 
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observed but we don't know if the fluid path valve is open or dosed, we may assume the 
valve is open if there is 40 evidence to the contrary. 



Generalization scenario: Phenomenon where V represents a set of observables, is 
given. Existing knowledge indicates that candidate explanation S cannot apply because 
condition d is known to be false in the current situation. However, B does follow if condi- 
tion Ci is replaced by the next most general relation, since d's sibUng is true in the current 
situation. This is a standard knowledge base refinement scenario (e.g.. Smith et al., 1985). 

Analogy scenario: Phenomenon V, where V represents a set of observpbles, is given. 
No candidate explanation B is available directly, but explanation is available if a series 
of analogical assumptions are made, that is, if the situation explained by is assumed 
analogous to the current situation. For example, if heat flow is observed, but little is 
known about heat phenomena, then an explanation may be constructed by analogy to 
liquid flow. 

Each scenario must perform the interpretation-construction task: retrieve from mem- 
ory expl-vnatory hypothetical matching the current situation. Each must perform the 
interpretation-selection task: select from a set of candidate hypotheses that which is most 
probable, plausible, coherent, etc. If the best we can do is distant analogy, then that is 
the best we can do. Why should abductive assumptions be Umited to assumptions over 
the boolean truth values of unknown facts? Abduction baould be broadened to include 
assumptions about the validity of a generalization or the validity of an analogy relation. 
Is osmosis a generalization of standard liquid flow, or is it merely analogous? What about 
a siphon? At what pdnt does something stop being a within-domain, Uteral similarity 
comparison and become an across-domain analogy comparison? Ideally, an explanation 
facility should be able to say "the observed phenomenon T is extremely Uke phenomenon 
V in the relevant relations", even if some required condition is beUeved absent. Consider 
a chemical reaction R needing catalyst C. If the effects of R are seen, yet the catalyst is 
not present, we should still be reminded of that chemical reaction if no better hypothesis 
exists. 

Existing explanation systems suffer from the adaptability problem they are unable to 
offer a best guess in light of an imperfect domain theory and unable to apply knowledge of 
one domain to the understanding of another. We can solve this problem by (1) generalizing 
abduction to include analogical assumptions and (2) developing a uaified architecture to 
support such abduction. This motivates the following conjecture: 

6 

o IS 

ERIC 



• Similarity conjecture: All interpretation-construction tasks may be characterized 
as the search for maximal, explanatory similarity between the situation being 
explained and some previously explained scenario. The previous situation may 
be drawn from an actual experience, a prototypical experience, or an imagined 
scenario derivable from general knowledge. 

In some sense this conjecture is conservative and plausible. Yet historically AI sys- 
tems have not operated this way. Part of the reason has been the view that identifying 
similarity is computationally intractable (Hayes-Roth & McDermott, 1978; Winston, 1980; 
Kline, 1983; Greiner, 1986). This thesis counters that claim with an efficient and flexi- 
ble algorithm for performing similarity matches, implemented in a program called SHE. A 
consequence of the similarity conjecture is that the strong distinction between deductive 
explanation processes and analogical explanation processes should be eliminated. While 
phenomenologically distinguishable, this does not necessarily imply a distmction in the 
process model. The same basic processes may be used in each explanation scenario and the 
distinctions between them correspond to how well existing knowledge supports the expla- 
nation. Everyday, common deductive operations correspond to the high confidence derived 
from identicality matches. We can reformulate the traditional abduction processes as being 
special cases of analogical explanation. 

A corollary to the similarity conjecture is that the same basic processes are used in 
both scientific theory formation and everyday interpretation and hypothesis formation. 
The famous, analogy-induced discoveries chronicled in the history of science literature are 
rare because (1) science is difficult and (2) they represent analogical reasoning in its purest 
and most difficult form - the sifting through irrelevant details to recognize similarities that 
lay hidden and thus reformulate a problem in an entirely new light. A scientific theory 
is distinguishable from everyday conjectures by its degree of specificity in accounting for 
phenomena and by how carefully it is analyzed. However, there is nothing fundamentally 
different in the baric process (see (Leatherdale, 1974) for a rimilar claim). 

The benefits of this view are a single computational architecture for explanation pro- 
cesses. Distinctions between explanation types only influence weighing of evidence and 
deciding whether a new conjecture represents a revision of existing knowledge or a new 
separate body of knowledge. This theris seeks to demonstrate the fearibility of this view by 
developing a system that uses analogical rimilarity to focus the search for explanations and 
develop novel theories when existing knowledge is insufficient. It should not be construed 
as a claim that the explanation problem has been solved by using analogy. Rather, it is 
intended as an important first step towards that goal. 
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1.2 Analogy 



The primary lesearch topic of this thesis is the analogy process, with explanation in phys- 
ical domains the application task. How are analogies initially recognized? How are they 
elaborated to sanction new inferences about a novel situation? What is analogy's role as 
aid to a more global performance element? In particular, how may it be applied to the 
task of providing plausible explanations of physical phenomena? 

Analogy may be described as having three functional blocks: given a current target 
situation (1) access retrieves another description, the base, from memory which is analogous 
or siiiiilar to the target in some respects, (2) mapping isolates a set of correspondences 
between the base and target and uses them to support the transfer of additional base 
knowledge to the target, and (3) evaltuition and tue estimates the quality of the analogy 
and uses it for some performance task. 

A central point of this thesis is that analogy is often an active, iterative process of 
hypothesis formation, testing, and revision. Previous accounts of analogy have focused on 
matching two descriptions to produce new knowledge. My acc ount goes beyond these by 
explicating the role of analogy in explanation and learning. This thesis explores several new 
questions in analogical reasoning and learning. How is an analogy used once it is estab- 
lished? What role does analogy play during an extended period of reasoning and problem 
solving? Analogy may sometimes be an atomic, "single inference" process. However, often 
it is not. 

1.2.1 Access 

In previous models of analogy, one starts with a partially understood model of a domain (or 
a teacher-supplied analogical hint which serves the same purpose). This incomplete model 
is then used to gain access to a folly understood analogue and to constrain the mapping 
from the analogue to the current model. Analogical learning situations pose an interesting 
problem - when entirely new theories are developed, o. knowledge of the current domain 
is strongly underspedfied, nuch a model of analogy fails to function due to lack of infor- 
mation. This led Burstein (1986, pg. 352) to claim that matching "cannot be the basis 
of a general theory of learning by analogy". However, this precludes autonomous learning 
(his system worked in the context of a tutor). Something must match to trigger the initial 
recognition of similarity, and this match may then be used to sanction further inferences. I 
claim that three tjrpes of information are generally available in autonomous learning situa- 
tions: (1) knowledge of what information is desired and what purpose it should serve, (2) 
correlations between available and desired information, and (3) abstractions char icterizing 

8 



various upects of the situation. Important strategies in accessing analogies are keying on 
whatever knowledge is most readily available, is most complete, and is correlated to infor- 
mation sought. For example, given a complete behavioral model and a scant causal model, 
it would be wise to key on similar behavior rather than focusing solely oa the causal model 
of primary interest. The resulting reminding provides a candidate analogue and an initial 
set correspondences that constrain the mapping process. 

This thesis adopts that strategy. Behavioral similarity is used to initiate and guide 
the analogy process. For example, consider constructing an explanaticm for the heat flow 
situation depicted at the top of Figure 1.1 (a hot brick immersed in cold water). Little 
is known about thermal phenomena and what causes the two objects shown to change 
temperature the way they do. However, more is known about the behavior. This may be 
used to seek an experience (real, prototypical, or constructed) that demonstrates the same 
detailed type of behavior. Knowledge of the particular physical configuration, the domain, 
and any initial interpretations that fill in some gaps may be used to constrain the search 
and augment the description when the behavior is underspedfied. For example, the way 
the temperatures are asymptotically approaching each other suggests that perhaps a single 
exchange is taking place between the two objects. Suppose liquid flow is suggested based 
on how its behavior is similar to what is happening. This similarity suggests an initial set 
of correspondences between the two situations: the cooling brick to the source of liquid 
flow, the heating water to the destination of liquid flow, and temperature to pressure. We 
can then recall what explains the liquid flow behavior and attempt to map it to the current 
heat flow situation. 

1.2.2 Mapping 

Once a candidate analogue has been identified, mapping serves to complete the initial set 
of correspondences {matching) and propose inferences sanctioned by those correspondences 
{carryover). The approach to mapping used in this thesis is a modification of Centner's 
(1980, 1983, 1988) Strxicture^mapping theory of analogy. Structure-mapping provides rules 
for analo^cal mapping which are based solely on the structural properties of domain de- 
scriptions, rather than on their content. Furthermore, it introduces the systematicity prin* 
ciplCj which states that the amount of common higher-order relational structure determines 
which of several possible matches is preferred. 

Chapter 2 reviews the structure-mapping theory and describes limitations that have 
been found with it. These center around its assertion that relations must match identically 
and that only the structural properties of domain descriptions determine the mapping. No 
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Contain«r(b««k«r2) Coiit«iii«r(gla«»4) 
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Disappearing Alcohol Scenario Distolving Scenario 

Figure 1.4: Potentially analogous situation descriptions. In one, alcohol left sitting in an 
open beaker is disappearing. In the other, salt is dissolving in a glass of water. 

domain-specific or problem-specific information is used by the mapping process. It would 
have difficulty, for example, establishing a correspondence between the cylindrical shape 
of one liftable cup and the handle of another liftable cup.^ It could not reason about the 
content of the situation descriptions to see that the two, non-identical expressions provide 
the same function. 

This thesis adopts a knowledge-intensive view of the structure-mapping paradigm, called 
contextual ttrueiure-mapping, which is described in Chapter 2. It uses knowledge of the 
representations being matched and the current reasoning task to help constrain and guide 
the match. 

Suppose we observe a situation in which alcohol left sitting in an open beaker is disap- 
pearing. Further suppose that our knowledge of physical phenomena is limited to various 
kinds of liquid flows, heat flows, boiling, and dissolving. We might propose that processes 
like liquid flow or boiling were taking place. Consider how a third possibility, salt dissolving 



^This it • Kconiiig example in the machine learning litetatuie, originating in (Winston et al., 1983). 
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in a glass of water, may be used to propose an explanation for the alcohol's disappearance. 
T>.e initial descriptions for the alcohol situation (the target) and the dissolving situation 
rtUe base) are shown in Figure 1.4. Comparing these situations, two interpretations appear 
to be possible. In one, gla8s4 and vat«rl correspond to beakar2 and alcoholl, respec- 
tively. In the other, saltl would correspond to alcoholl, since they are both decreasing. 
Systematidty, the preference to masdniize the amount of shared relational structure, pro- 
vides little guidance and would appear to prefer the former interpretation. However, the 
purpose of the analogy is to explain the observed behavior^ and only the second interpreta- 
tion provides a comparison which includes the behavior. In contextual structure-mapping, 
knowledge of current reasoning goals, such as the preference to focus on features being 
explained, combine with systematidty to influence which matches are prefierred. 

1.2.3 Transfer 

The role of mapping is to establish correspondence and identify base information potentially 
inferrable for the target. These candidate inferences are hypotheses whicii must pass a series 
of evaluative processes before being accepted as holding for the target domain. The first of 
these is transfer. Transfer is concerned with importing candidate inferences into the target 
domain and making them operational. This centers around ensuring consistent expression 
use and identifying unknown objects proposed by the mapping. The model of transfer 
developed in this thesis has two important characteristics. First, it provides an expanded 
account of the complexities in importing proposed analogical inferences into the target 
domain. Second, mapping and transfer may interact in an iterative manner by repeating 
the match in light of additional information found during transfer's examination of the 
proposed inferences. 

The saltl to alcoholl correspondence provides a partial, initial explanation by sup- 
porting the inference that something analogous to the Dissolving-Process may explain 
the alcohol's behavior. However, this process describes the interactions between saltl and 
vatsrl, yet there is no apparent correspondent for vatsrl in the alcohol scenario. Further- 
more, predicate type restrictions are violated for some of the imported base relations being 
applied to target objects. For example, ZnMrssd-in(alcoholl,?unknovn) is improper, 
since the first argument to lonsrssd-in must be a solid object, while alcoholl is a liq* 
uid. To resolve these problems, the carryover component must seek additional knowledge 
about the alcohol situation. To seek an alternative to Inmsrssd-in, it first determines 
that iBMrssd-in was in the dissolving scenario to ensure physical contact between the 
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salt and the water. Seeking objects saving physical contact with the alcohol, we find that 
the atmosphere is Touching the alcohol due to the open beaker. 

Detecting and resolving these problems is an important component of attempting to use 
the results of mapping. Analogical inferences (e.g., Diisolving-Proctis and 
I»r««d-in(alcolioll,?uniaiown)) represent information in the base that had no corre- 
spondent in the target description. These must be examined before being posited as new 
target knowledge. Since base relations are being imported, they may apply predicates to 
objects or propositions other than their conventional referents. Substitutes for such pred- 
icates must be found or created. These inferences may also contain unknowi.>. anticipated 
objects: slots occupied in the base representation by objects that had no correspondent in 
the match. A corresponding target object must be found, or the existence of the unknown 
object postulated. 

Since a new object, atmosphera, and information about it have been added to the 
current working vocabulary of the target alcohol scenario, mapping should be repeated 
using this augmented description. Although not the case in this example, we could find 
that atmosphere is a better correspondent for saltl than is alcoholl. 

Proposed inferences represent holes in the match. They need not represent new knowl- 
edge, but may simply indicate places where additional knowledge should be retrieved to 
help complete the analogy. Transfer conducts focused probes into memory to seek more 
information about places where the similarity match was incomplete. If additional knowl- 
edge is fi)und, transfer will augment the existing base and target descriptions and iterate 
back to the mapping process, to see if the new information will affect the overall mapping. 
Mapping and transfer combine to form a map and analyze cycle to provide focus to the 
analogical mapping process. Were we to retrieve everything we possibly knew about the 
base and target prior to mapping, a great deal of time might be spent on unconstrained 
inferendng. By initiating the match on what appears to be relevant from immediately 
available information, transfer may focus on seeking the specific information the match 
indicates is lacking. 

Comparing the original dissolving situation with the augmented disappearing alcohol 
scenario, we find that wat«rl corresponds to atmosphere, while saltl corresponds to 
alcoholl as before. In the process, we must place Inm«r8«d-in(8altl,wat«rl) in corre- 
spondence with Toucliing(alcolioll,at«o8pli«r«), since both ensure physical contact. This 
second match is complete enough to enable consistent transfer of the dissolving explanation 
into the evaporation situation. There are no unknown objects in the proposed explanation 
and all predicate instances are consistent with their declared usage. 

This last phase demonstrates an extension to structure-mapping theory's identicality 
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requirement. Knowledge about the representations is used when finding correspondences 
by stating that items may match if they are functionally analogous. Items are functionally 
analogous if they provide the same inferential support in the context of the structures being 
matched. This enables iBMri td-in to match Touching. In the cups example described 
above, it would find that the first cup's cylindrical shape corresponds to the second cup's 
handle by virtue of their both supporting the common constraint of being liftable. 

1.2.4 Verification-Based Analogical Learning 

Evaluating the quality of a proposed analogy and using it for some performance task is 
a crucial phase of the analogy process. Are the proposed inferences correct, likely to be 
correct, or consistent? Because of the approximate nature of the process, the proposed 
inferences must be examined to be sure they even predict the very situation they e 
intended to explain. Most models avoid this validity issue by either stopping once infer- 
ences are produced (Holyoak & Thagard, 1988b; Kass et al., 1986) or requiring deductive 
analogies in which the results could have been achieved (more slowly) without the anal- 
ogy (KHng, 1971; Carbonell, 1983a; Kedar-Cabelli, 1985b). Inferences produced by analogy 
must be questioned. This is reflected in the following requirement? 

• Verification requirement: When possible, the results of analogy must be tested 
empirically and against other knowledge. 

Verification^based analogical learning (VBAL) is designed specifically to satisfy this 
requirement. In VBAL, analogical learning is seen as an iterative process of hypothesis 
formation, verification, and revision, centered around the requirement to confirm accuracy 
and increase the likelihood of being correct. It relies on analogical inference to propose 
explanations and gedanken experiments (i.e., simulation) to analyze their validity. 

Consider the floating beachball behavior introduced in Figure 1.1 and repeated in Fig- 
ure 1.5. Two hypotheses appear to be possible. One proposes that the air is pushing on the 
ball £rc Ji either side, holding it in place. The ot^her recalls that air flowing over an object 
can have a pulling effect (due to a pressure gradient), as in the lift provided an airplane 
wing. How may we evaluate these two hypotheses? Having passed all the tests so far 
(i.e., correlated information selected during access and local consistency checks performed 
during mapping), the best thing to do is try them out. Do they generate a complete and 
consistent explanation? If not, then hypothesis generation and revision must continue. 

In this case, we find that while both proposals are locally consistent (i.e., no direct 
contradictions with existing knowledge), only one of them consistently predicts the observed 
behavior. According to the ^pushing'' explanation, oiEsetting the ball to the right results in 
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Figure 1.5: A beachball suspended in & vertical column of flowing air is extremely stable. 
Is the flowing air pushing on both sides, or is it "pulling" on each side? 



increased force towards the right due to the additional air flow on the left. This violates the 
stability aspect of the observed behavior. The "pulling" explanation, however, consistently 
predicts the observed behavior. Offsetting the ball to the right results in increased force to 
the left, which draws it back into the center of the air stream. 

Had both hypotheses failed, we would be faced with the revision problem: how to refine 
the results of an analogy when it is found to be awry. Analogy for problem solving is 
intimately tied with theory revision. It produces approximate, heuristic conjectures that 
may need adjustment. This thesis depicts the revision stage as being roughly equivalent 
io the original generation of explanatory hypotheses. It considers behavior analogous to 
that which led to the need for revision and considers how the current anomalous situation 
differs from prira situations that were consistently explained. In this manner, revision 
starts the whole process over again - what might be causing the anomaly, is it similar to 
other observed behaviors, and so forth. 

1.3 Physical Analogies 

A causal model of a physical system m 'y be analyzed to give rise to a sequential description 
of what will happen in the world. For example. Figure 1.6 shows a simple model for a spring- 
block oscillator and the physical behavior it predicts. The predicted behavior may then 
be compared to actual observation (Figure 1.7), to verify that the continuous operation of 
the oscillator corresponds to stepping throujjh the eight predicted states. Thus, physical 
analogies, such as "Heat is like water", h»»;/e the unique characteristic that they relate 
runnabk models (Waltz, 1981; Centner & Stevens, 1983), that is, one may actually envision 
heat "flowing" from one location to another. One may reason about them, make predictions, 
and observe their results. Furthermore, models of physical systems may be decomposed 
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Figure 1.6: A qualitative model for a spring-block oscillator and its corresponding envi- 
sionment. 



into different perspectives of the same phenomenon. For example, a behavioral model of 
water flow would describe the physical action of water flowing, while a causal model for 
water flow would relate the activation of flow to the inequality of pressures. 

Physical modeb are defined to be models about physical phenomena which have the 
two following characteristics. First, the inodels being compared must be generative^ that 
is, runnable and applicable to new, unforeseen situations. This means that systematic 
analysis of the model will produce measurable predictions for the physical world. Second, 
the models must be decomposable into different perspectives of the same phenomenon. 
Thus, the same physical system may have a behavioral model, a causal model, a structural 
model, and a quantitative model. Physical analogies are defined to be anaU^gies which 
relate physical models. In this work, I focus on learning causal models. A caiisal model 
may be defined as a collection of axioms or rules for some domain which explains, by causal 
argument, the behavior of the domain. 



1.3.1 Qualitative Physics 

An interesting type of model it a naive or qualitative model of physics. People are able 
to Vise conunon sense knowledge to understand and predict everyday physical occurrences 
such as throwing a ball, boiling a pot of water, or breaking a glass bovvl. They are able 
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to reason about these occurrences in simple, qualitative terms, without need of a formal, 
mathematical physics. For example, upon encountering a pot of water on a hot stove, a 
person would be able to predict that "the water will heat up and eventually begin to boil." 
They are able to use purely qualitative terms, without knowledge of the precise quantities 
or durations involved. 

A qwilitative simulator takes a model of a particoiar physical configuration and produces 
a description ot the possible behaviors for the given situation, called an envisionment An 
envisionment describes phy. 1 states and the possible transitions between them. The 
behavior of the system t' ough time may then be represented as a single path through the 
envisionment, with each state representing an interval of time in which behavior does not 
change. Such a path will be called a history, after (Hayes, 1979). 

This thesis uses Forbus' Qtuilitative Process TAeory (1981, 1984) as the primary formal- 
ism to represent and reason about change in the physical world. In QP theory, physical 
changes such as moving, colliding, bending, heating up, and cooling down are thought of as 
processes. A key tenet of Forbus' theory is that processes are central to human knowledge 
about physical domains (Forbus tc Centner, 1983). In QP theory, a situation is represented 
as a collection of objects, a set of relationships between them, and a set of processes which 
account for all changes in the world. Each object has a set of continuous parameters, such 
as TEMPERATURE and PRESSURE, which are r*??ic;?nted as quantHies. Each quantity has an 
amount, as in A [TEMPERATURE (brick)] , and a derivative, as in D [TEMPERATURE (brick)]. 
Amounts and derivatives are constrained by the inequality relations which hold among the 
different quantities (the quantity space). 

Process definitions have five components: individuals, preconditions, quantity condi- 
tions, relations, and influences. The individuals specify what objects wouid be involved in 
the process if it were active, the preconditions and quantity conditions indicate when the 
process will be active, and the relations and influences specify what relations will hold while 
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Figure 1.7: The behavior of a frictionless spring-block oscillator. 
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Figure 1.8: A QP Theory definition of the liquid flow process. 



the process is active. Preconditions are changeable conditions th&t cannot be predicted in 
terms of change to continuous quantities. Quantity conditions correspond to changes in 
the relative relationship between two quantities and are predictable from the qualitative 
model. A typical QP theory definition for the water flow process is shown in Figure 1.8. 

An additional type of QP theory description is the individual view. Individual views 
represent various configurations and states of objects that may be subject to dynamical 
conditions (e.g., a spring may be stretched, relaxed, or compressed). They are expressed 
in the same manner as processes (i.e., individuals, preconditions, quantity conditions, and 
relations), but difier in that they have no influences. 

/ central goal of this thesis is the use analogy for the development of theories about 
the physical world. Using the QP formalism, a ^theory" consists of a set of process de- 
scriptions (implemented using daf Procass), individual view descriptions (def Viev), entity 
descriptions (dafEntity), and atomic facts. 

Interpreting observations of the behavior of a phyrical system may be jierformed by 
finding a path through a total envirionment for the system which corresponds to ob- 
servations. Forbus (1986a) presents such a theory of measurement interpretation (ATMI) 
and describes the implementation . It takes an envisionment about the scenario and a set 
of time-ordered measurements. A model is able to provide an explanation for the observa- 
tions if a path through the envisionment it produces can be found which corresponds to the 
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measurements. For example, the plot of Figure 1.7 would be divided into temporal inter- 
vals of qualitatively equivalent measurements (e.g., velocity > 0 and increasing) and then 
compared to the eight-state description of Figure 1.6. When measurement interpretation 
fails, it must be because the measurement sampling was incomplete, the physical system is 
broken, or the qualitative domain model was incorrect. This work addresses the problem 
of incomplete and inconsistent domain modeb and does not address the sampling problem 
or system troubleshooting. 

1-4 PHBVEAS 

The basic model of analogy described in Section 1.2 is instantiated in PHIHEAS, a fully im- 
plemented program that offers qualitative explanations of time varying physical behaviors. 
It relies on analogical inference to hypothesize new theories, and uses qualitative simula- 
tion as a form of gedanken experiment to analyze their validity. It uses an iterative process 
consisting of four primary stages (Figure 1.9): 

1. Behavior match. Behavioral similarity is used to initiate and guide analogy. Anew 
observation triggers a search for previously understood experiences that exhibited 
analogous behavior. Abstractions of the observed situation and its behavior are used 
to provide indices mto memory. The result of this stage is a candidate analogue and 
an initial set of correspondences between the two domains that serve to guide the 
mapping process. 

2. Theory generation. The objective of the second stage is to produce a fully opera- 
tional initial hypothesis about the current domain. This has two components. First, 
the models used to explain analogous aspects of the recalled experience are retrieved 
and analogically mapped into the current domain. This mapping is guided by the 
initial correspondences found in the behavioral comparison. Second, to operationalize 
the model, any unknown entities and properties it requires must be inferred from the 
domain theory or their existence must be postulated. 

3. Gedanken analysis. The operational model is used to construct an explanation 
of the present observation. It is used to generate an envisionment of the scenario, 
which is then compared to the original observation. If the explanation is consistent 
and complete, then the explanation is considered successful. If the explanaticm is 
inconsistent or fails to provide complete coverage, then revision is required. 
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Figure 1.9: Fonctional decomposition of PHINEAS. 



4. Revision. If an initial hypothesis fails, or an old Iqrpothesis is inadequate for a new 
situation, an attempt is made to adapt it around points of inaccuracy. Revision relies 
on past experiences to guide the formation and selection of revision hypotheses. It 
considers behavior analogous to the current anomaly and considers how the current 
anomalous situation differs from prior situations that were consistently explained. 

The revision module is only partially implemented. Every other aspect of PHINEAS is 
iuUy implemented and tested on over a half dozen examples. 

rflllEAS demonstrates how analogical reasoning and learning can be used in scientific 
investigation ot everyday physical interpretation. Good scientific investigation involves 
asking the right questions, questions which are n'^tivated by highly plausible conjectures 
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and potentially frnitful lines of inquiry. Analogy provides a frame of reference from which 
to draw expectations and motivate such questions. It can provide an initial foothold into 
a new domain, which may then be debugged and eventually stand on its own. 

1.5 Reader's Guide 

This thesis presents a detailed study of the role of analogy in explanation and learning, 
focusing on qualitative explanation of physical phenomena, and an implemented program, 
called PHIIEAS which demonstrates the feasibility of the approach. 

Chapter 2 describes a general framework for analogical processing, its components, and 
its roles in reasoning. It then reviews Centner's structure-mapping theory and discusses 
the technical limitations that have been found with it. This is followed by a presentation 
of contextual structure-mapping^ which addresses these limitations. The chapter includes a 
number of specific problems in analogy research which need addressing. 

A prerequisite to general analogical processing is the ability to detect similarity by 
identi^ng correspondences. Chapter describes the structure-mapping engine (SHE), a 
geneial tool for performing various types of analogical matchings. The SHE algorithm is 
briefly reviewed (see (Falkenhainer et al., 1987) for a complete description), foUowe by 
a detailed description of how it is configured to model the contextual structure-mapping 
process. The program is then analyzed from both analytical and empirical perspectives. 

The next four chapters sequentially discuss how each step in the analogy process is 
performed. Each of these chapters is accompanied with working examples from PHIHEAS 
demonstrating that phase of the process. 

Chapter 4 describes the approach to access, which focuses on analogous behaviors to 
guide the formation of causal explanations. 

Chapter 5 describes the mapping and transfer phase of the analogy process. It explains 
how analogy may be used to propose novel theories and the various techmcal diflSculties 
that arise when knowledge is ported from one domain to another. 

Chapter 6 discusses verification-based analogical learning and the importance of the 
evaluation and use phases <rf analogical processing. It reviews the process of measurement 
interpretation, which is in charge of determining if a proposed explanation accurately pre- 
dicts the observed behavior. The chapter shows how VBAL serves three roles in analyzing 
proposed explanations: as confirmation, as discrimination, and as analysis of coverage. 

Chapter 7 discusses how initial, flawed hypotheses may be revised into complete, consis- 
tent explanations. Unlike the other chapters in this thesis, which are fully implemented, this 
chapter describes a proposed approach to revision, which has yet to be fully implemented. 
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The PHIHEAS implementation and how the different program modules actually com- 
bine to produce coherent behavior is described in more detail in Chapter 8. Chapter 9 
demonstrates the operation of PHIHEAS with a number of detailea examples. It provides a 
complete description of each example from start to finish, unlike the piecewise examples of 
the preceding chapters. Chapter 10 concludes with a summary, discussion of related work, 
and suggestions for future research. 
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Chapter 2 



Similarity Comparisons and 
Analogical Processing 

Explanation through physical analogies leqtiiies a robust model of the analogy process. This 
chapter establishes the requisite terminology, describes a general framework for analogical 
processing, what its components are, and th<- requirements of a comprehensive model. It 
begins by examining the diiFerent aspects of analogy and its various roles in reasoning. It 
then reviews Centner's Structure-mapping theory, the primary source for the definition of 
similarity used in this thesis, and discusses some of its limitations. Contextual structure- 
mapping, an adaptation of Centner's theory which addresses these limitations, is then 
presented. Finally, some general problems in analogy research are discussed. 

2.1 The Analogy Process 

Analogy is a mapping between one domain description (the base) and another (the target) 
that identifies a correspondence between their respective bodies of knowledge. Not all 
mappings represent analogies and part of the analogy research objective must be to define 
what mappings will be considered analogical. Each domain description consists of a set 
of objects and a set of related facts about those objects. Analogy then matches a base 
description with a target description of varying levels of completeness. When knowledge 
of both domains is equal, analogy indicates where the correspondences are and has no 
predictive power. Typically, the target will lack certain relations and analogical inferenct 
may occur; relations which hold in the base but are not currently known to hold in the 
target are mapped into the target domain if sanctioned by existing correspondences. Hence, 
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analogy is most useful when familiar knowledge may be mapped to an unfamiliai situation, 
providing the necessary machinery to generate further inferences. 

Analogy is a complex problem, and the appropriate decomposition is critical. Without 
a good decomposition, it is easy to contend with several semi*independent problems at 
once and become lost in the space of possible mechanisms. A number of researchers have 
recognized that analogy may be divided into at least the following three subprocesses (e.g., 
Clement, 1981; Centner k Landers, 1985; Centner, 1988; Kedar-Cabclli, 1985a; Hall, to appear, 
Falkenhainer, 1986): 

1. Access. Civen a current target situation, the first role of access is to retrieve from 
long-term memory a body of prior knowledge, the base, which is analogous or similar 
to the target. Second, it should attempt to gamer out those features of the base 
that are pertinent to the analogy, by examining it in light of the current reasoning 
context. These two aspects may occur simultaneously. Here we will define the first 
aspect as retrieval the second as relevant feature selecttotL Using the terminology 
of Clement (1981, 1982), both aspects of access are required for spontaneous analogy 
. analogies which occur without prompting. When the base and target are explicitly 
given, or when hints are provided {provoked analogy), the access stage may be reduced 
or eliminated entirely. 

2. Mapping. The mapping stage consists of identifying a coherent set of analogical 
correspondences between the base and target {matching component), and possibly 
inferring additional base knowledge for the target {carryover component). Matching 
may irvolve eidending previously existing correspondences. This match may also 
sanction a set of candidate inferences that identify additional knowledge possibly 
transferable to the target. The quality of a match may be a function of several factors: 
similarities, differences, and the amount and type of new knowledge or insight the 
analogy provides. Not all models distinguish between matching and inference, in 
which case mapping consists solely of carrying over a set of relevant base relations. 

3. Evaluation and Use. The validity of the match and the inferences it sanctions 
must be examined through further reasoning processes, such as problem solving, 
consistency verification, or experimentation. Evaluation of validity may occur as 
a separate verification process, as a side-effect of use in a more global reasoning 
context, or as some combination of the two. Cooperative interplay may occur between 
mapping's process of creating inferences and attempts by the performance element 
to evaluate and use the inferences. 
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As a functional specification, rather than a modular decomposition, the above three 
components specify the general tasks that any treatment of analogy must address. Dif- 
ferences among alternate treatments typically correspond to where the modularity line 
between the tasks is drawn, how the tasks interact, and how each task is actually per- 
formed. It is generally agreed that the mapping stage is central to analogy, and most 
researchers have focused on this component. 

Another important distinction for analogy is based on its intended use and thus what 
functionality a model of ^alogy must provide. Different uses may be distinguished by the 
types of inferences they draw. Take to denote the analogical inference operation, 
while a statement of analogical correspondence will be represented by the operator. 
We may then describe analogical inference as being of the form 

A fit A 
Tt A Unknown[Gt] 
^ Qt 

Given that sets of known facts, and Vt, are the same or similar (P*~Pt), and that a 
set of facts, fit, are known to held while the status of a related set of facts Qt is currently 
not known (but perhaps may be derived), infer that Qt also holds (under whatever trans- 
formations allowed ^ and Vt to match).^ The statement of similarity may be given (e.g., 
Buxsten, 1983; Greiner, 1988), or autonomously derived. Other constraints are generally 
required, such as that Qt is consistent and that fit be somehow correlated to T^. I will 
return to this issue in Section 2.4.4. 

Borrowing from Indurkhya (1987), we may formally classify two types of analogical 
inference: 

Definition 2.1 A set of analogical inferences are deductively sound (d-sound) if every 
sentence being mapped is logically entailed by the target. The inferences are inductively 
sound (i-sound) if they are merely consistent with the target, rather than entailed Ify it.^ 

Thus, some analogical inferences may be seen as knowledge which exists for the target 
domain, yet if not explicitly stated (i.e., search or inference would be reqmred to retrieve it). 

*Thi« is a fcirly tUndard deflaition, bat difTen from that ued by Gteinet (1938), whose leatning model 
teqoiies that the status of onknown hold ovet the deductive elosnre. 

'Indnxkhya (1987) uses coherent and its subset, itnmgly eokeretU, to lefet to the entize analogical 
mapping (matches pins inferences). I adopt duound and i-$ound because of the alternate meanings of 
"coherent" in the litetatuie. This represents a deviation from (Falkenhainer, 1986), which used the term 
"coheieat". 
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Other analogical inferences represent new knowledge - conjectured facts about the target 
domain. This is an important distinction for categorizing the different uses of analogy: 

Similarity-Based Generalization. In analogy, one may use the similarity between two 
concepts found during the matching stage to form a single, generalized concept. 
This corresponds closely to many empirical (often called similarity-based) learn- 
ing methods, some of which use pattern matching techniques to detect similarities 
in the structural representation of features (e.g., Hayes-Roth & McDermott, 1978; 
Michalsld, 1980; Hoif et al., 1982; Skorstad et al., 1988). 

Analogical Reasoning. Analogical reasoning is concerned with improving the effective 
use of existing knowledge. It involves using past experiences as heuristics to guide or 
accelerate current reasoning processes, as in using analogical inferences to recommend 
promising search paths. For example, in (Kling, 1971), analogy to a past problem 
suggests which clauses should be used in a new theorem proving task. Carbonell 
(1983a) uses traces of past reasoning steps to help guide search in planning tasks. 
Kedar-Cabelli (1985b) uses known examples of some concept to guide a proof that a 
new object is an instance of that concept. This classification also applies to most case- 
based reasoning systems, which draw on previous instances of similar situations for 
guidance (e.g. Kolodner et al., 1985; Bain, 1986). Here a strict definition of reasoning 
by analogy will be employed, in which analogy is used for guidance, and mapping and 
use combine to produce d-sound inferences. 

Analogical Learning. Analogical learning is concerned with the acquisition of new knowl- 
edge. Given similarity between Vh and Vt, assume the truth of Qt, where the status 
of Qt cannot be determined in the current deductive closure. Roughly, if a target 
situation is encountered which is believed to be similar to some well understood base 
situation, perhaps additional knowledge available about the base may also hold for 
the target. Thus, analogical learning uses i-sound analogical inferences to posit new 
krowledge about the target domain (e.g., Burstdn, 1983; Grdner, 1988). 

These uses of analogy may be intermixed. The results of analogical reasomng processes 
may be stored, which is an analogy-independent process equivalent to analytical learning 
(EEL). In addition, one may use the results of analogical learning to form a more general 
concept description. For example, knowledge of the solar system could be used to learn 
about the Rutherford model of the atom. In turn, these could be used to form a general 
concept of central-force systems. 
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Tliis thesis, in combining interpretation and theory formation paradigms is concerned 
with both analogical reasoning and analogical learning. Similarity to experience is used 
to suggest plausible explanations. If this represents an existing theory, then analogi- 
cal reasoning has been performed. If it represents a novel (i.e., at the knowledge level 
(Dietierich, 1986)) theory, then analogical learning has been performed. However, most 
emphasis is placed on learning, since it is the more difficult of the two and must work extra 
hard to maximize the probability of correctness, a desirable trait in general. 

From this taxonomy, we can see that the two primary functions of analogy are (1) 
establishing similarity and (2) sanctioning analogical inferences. While this is perhaps 
rather obvious, it is important to keep in mind. Few computational models of analogy 
explicitly provide both functions. 

2.2 Structure Mapping Theory 

The model of mapping used in this thesis is an adaptation of Centner's (1980, 1983, 1988) 
structure-mapping theory (SMT) of analogy. This section review's Centner's theory and 
introduces some of the theoretical distinctions that have been drawn from it. Problems 
encountered with the structure-mapping theory which helped to motivate the hybrid model 
used in this thesis are then discussed in the next section. 

Structure-maj^iping describes the set. of implicit constraints by which people process 
mappings of analogy and similarity. It is based on the intuition that analogies are about 
relations, rather than simple features. No matter what kind of knowledge (causal models, 
plans, stories, etc.), it is the structural properties (i.e., the interrelationships between the 
facts) that determine the content of an analogy. The target objects do not have to resemble 
theJx corresponding base objects, but are placed in correspondence due to corresponding 
roles in the common relational structure. 

Structure-mapping theory states that predicates are mapped £rom the base to the target 
according to the following three mapping rules: 

1. Discard isolated object descriptions (e.g., RED) unless they are involved in a larger 
relational structure. 

2. Try to preserve relations between objects. 

3. Use systematicity to determine which relations are mapped. Systematidty states 
a preference for systems of relations as exhibited by the existence of higher-order 
relations (e.g., CAUSE). 
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Figure 2.1: Two physical situations involving flow (adapted from Buckley, 1979). 



This mapping must be 

• ont'to-ont: No base item maps to two taxget items and no target item maps to two 
base items. 

• structurally consistent: If relations B< and Tj are placed in correspondence, then their 
arguments must exhaustively correspond as well. 

• identical: Only identical relations are allowed to match.^ 

Centner's theory is important in that it provides rules for analogical mapping that 
aie based solely on the structural properties of domain descriptions, rather than on their 
content. This provides a general mapping component applicable to a variety of analogy 
tasks, including concept learning, problem solving, and many types of metaphor interpre- 
tation. Furthermore, the systematidty principle captures the tadt preference for deep, 
well-supported knowledge in analogy rather than shallow assodations. Analogy implies 
that a relational system from one situation may be applied to another, independent of 
superfidal similarity or dissimilarity. 

For example, consider the water flow - heat-flow analogy shown in Figure 2.1 (from 
Forbus & Gentner, 1983). To process this analogy according to the rules of SMT, one 
must 

• Set up the object correspondences (e.g., beaker goes to coffee). 

*A revision to aUow non-identical matching for functions appearing as arguments of identical relations 
appears in (Gentner, 1988). 
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• Discard isolated object attributes (e.g., (Liquid water)). 

• Map the water flow relations into their corresponding heat flow relations. 

• Observe systematidty by focusing on relations belonging to a systematic relational 
structure. 

2.3 Limitations of Structure-mapping Theory 

Centner's model has many attractive features. However, several limitations have been 
found in attempting to apply it to a complex learning task. First, structure-mapping 
theory requires that all relations are mapped identically between base and target situations. 
This requirement is important in that it establishes semantic correspondence between the 
structures being matched. However, while examples of analogies satisfying the identicality 
restriction are easily constructed, it is too strong to achieve generality.* As Burstdn (1983) 
has pointed out, it is diflBcult to maintain relational identicality when mapping between 
physically realizable situations and purely abstract ones. For example, relating how a box 
can "contain" things to tht way a computer variable can "contain" a value. They share 
similar properties, but generally are distinguished, as in IHSIDE and IHSIDE-VAR. 

A second difliculty with SMT is the assertion that systematidty is the -ole selec- 
tion criterion for dedding among possible interpretations during mapping. One prob- 
lem is that the largest common relational system may not have anything to do with 
the intended goals of the analogy. K we are interested in learning about heat capac- 
ity by analogy to container volume in the liquids domain, we don't want the potentially 
larger relational match between heat flow and liquid flow to be the analogy. Much of 
the selection process may be performed prior to mapping by examining the base knowl- 
edge in the current problem solving context, as sanctioned by SMT and demonstrated by 
(Grdner, 1986; Falkenhainer, 1986; Kedar-Cabdli, 1988). Thus, if we were interested in 
learning about heat capadty, only knowledge about fluid capadty need be considered dur- 
ing mapping. However, there are two factors that work to defeat systematidty as a sole 
selection criterion: ambiguity and lack of knowledge. 

Mapping may often be a useful guide when the exact relevance of particular knowledge 
to the analogy is a priori ambiguous. Alternatively, information irrelevant to the current 
goals yet about the goal relevant knowledge may be crudal to disambiguate the match. 

^Gentnet recognises this limitation and has proposed iterative rerepresentatioa of decompowble predi- 
cates u one pouible solation (Centner, 1988). 
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For example, the fact that Pr«ttur« and Trap^ratur* are both intensive quantities (i.e., 
pdnt measurements) may be cruda^ 7hen considering whether T#B5)#ratur# should map 
to Pressor* or to iw^mt. However, if potentially irrelevant relations are allowed into the 
base and target representations, it is possible for them to dominate, producing a ^best" 
match that doesn't indu le the inferential structure wanted in the first place. In such 
situations, there is no way to force the desired candidate inference out of the mapping 
procedure if thtjre is no way to contextually guide it. This was empirically observed durinf; 
the devdopment of PHIIEAS. 

The other factor, lack of knowledge, is unavdd^ >le in learning situations. As the amount 
of knowledge about tLe target decreases, the available systematic match diminishes. At 
some point of ignorance, systematidty plays no dedsive role. Hence, some other factor 
must influence the sdection, such as the rdevance of inferences sanctioned. Finally, it seems 
reasonable to assume that some rdations are more salient than others. Requiring that these 
rdations are always of higher ordc** than less salient relations imposes strong constraints on 
the representation. Centner (1987) indicates that analogy occurring in contexts other than 
problem solving is a reason lo leave contextual factors out of the mapping component. It 
appears that contextual factors, when present, cannot be external to the matcher and are 
perhaps better viewed as optional influences on the mapping component. 

Other problems are more subtle, and are not limited to learning task. These problems 
were found by using SHE (Falkenhainer et al., 1986, 1987), a flexible analogical matching 
system that may be configured to obey the rules of struciure-mapping theory.^ It ap- 
pears that purdy structural, content independent matching rules are insuflident to prevent 
anomalous mappings. There are two types of anomalous mappings that arise: spurious 
r atches and structure rearrangement 

The spurious match problem may be understood by reconsidering the water flow - heat 
flow analogy with the ice cube immersed in the cofllec rather than attached to a metal bar. 
Suppose the causal description of fluid flow states necessary conditions for the partidpatory 
objects source (beaker), destination (via!), and path (pipe), with those describing the path 
(pipe) bdng: 

Phyiir^-ObjCpipa) A Pluid-Connaction(pipa, beaker, vial) A -.Blockad(pipe) 

If the only instances of any of these predicat'ts in the given target description are 
{ Phyiical-ObjCcoffaa), Phyiical-Obj(ica-cube), Phyiical-Obj(col«aa-cup)}, a purely 
structural match will map b«aker to coffee and vial to ice-cube, as before, but also 

•SMB is described in Chapter 3. When configured for structure*mapping theory, I will refer to it as 
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Water Plow 



: r D#c[Pr«S8 (beaker)] 0 \ Heeti ^ Const [Press (beaker) A 
pV ^Inc [Press (vial) ] ^V^Const [Press (vial)] -J- 



Heat Flow: 



X 



<Const[Tenp (coffee)] ^ Heeti ^ ?^ Dec [Temp (coffee)] ^ | 
Inc [Tenp (ice-cube) ] J V^Const [Teinp(ice-cube)j/-- 



Figure 2.2: The stmctore rearrangement problem in behavioral descriptions of liquid flow 
and ""heat flow". 



pipe to coffee-cup. Clearly, immersion or physical contact is the path in this heat flow 
scenario, not the coffee cup. This requires the use of knowledge about the structures being 
manipulated, rather than a purely structural match. However, this is not a problem unique 
to ?MT. It arises when one attempts to draw conclusions based purely on pattern matching, 
without first inspecting the results in light of a surrounding reasoning task. 

The structure rearrangement problem arises from seeking the maximal structural match. 
In any structure matching paradigm, it is necessary to allow substructures to match, which 
occasionally results in some of the higher-order structure being dropped in order to per- 
form the match. Sometimes, the best match can be achieved by ignoring higher-order 
constraints, moving pieces of structure around, and violating the intended semantics of the 
representation. For example, consider the two-state behavioral descriptions of water flow 
and ^heat flow" shown in Figure 2.2. The heat flow description has been altered to demon- 
strate the anomaly: the coffee temperature is constant in the first state and decreasing 
in the second state. When the rules of SMT are applied, every item matches except the 
temporal Meets relationship. That is, Inc [Press (vial) ^ maps to Inc [Tesp (ice-cube)] 
while Dec [Press (beaker)] maps to Dec [Tenp (coffee)]. All four temporally-scoped wa- 
ter flow relations have a structural correspondent in the heat flow situation. Time has been 
rearranged. 

Not all structure rearrangement is bad. For example, if temporal ordering is i. r'devant 
to the causal structure of a story, then temporal ordering should not constrain matching. 
However, rearran^ng many other knowledge structures is clearly inappropriate, such as 
decomposed objects, temponi states, and theories. Without inspecting the content of the 
structures being manipulated, there is no way to make this decision. From a psychological 
perspective, some forms of structure rearrangement confusions are probably witnessed in 
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humans. Some rearrangements are clearly invalid, others are arguably non-optimal. The 
key pcnnt is that if knowledge is available to spot these confusions, then it should be applied. 

2A Contextual Structure-Mapping 

Centner's structure-mapping theory takes the position that only the structural properties of 
the representations should be examined during mapping. In applying analogy to a complex 
learning task, I have had to adapt this view in several important ways. Specifically, I pro- 
pose that information about the structures being manipulated should be used to maintain 
consistency of the mapping, to reason about how items should be placed in correspondence, 
and to influence selection among possibly many alternative mappings. I call this approach 
canteztual structure-mapping^ since it uses knowledge about the representations being ma- 
nipulated and the context in which they are being used as an aid in the mapping process. 
It is a knowledge-intensive adaptation of Centner's theor/. 
Mapping is concerned with two important problems: 

• Correspondence Problem: What objects and relations may be placed in corre- 
spondence? 

• Selection Problem: What factors should decide how the "best** mapping is cho- 
sen? 

These two problems distinguish between what is allowed to match and what the set of 
correspondences will be. They have many facets, as discussed below. The central difficulty 
of mapping is restricting the enormous space of possibilities to a small, plausible subset. 
Constraints on the mapping process which determine admissibility and selection gener- 
ally fall into three classes (Hall, to appear): structural constraints preserve the relational 
structure of the descriptions, semantic similarity restricts pairwise matching of predicates 
according to their similarity, and contextual relevance motivates the mapping towards so- 
lutions relevant to the needs of the performance clement. The constraints and influences 
used in contextual structure-mapping are: 

1. One-to-one: SMT's one-to-one restriction is adopted in its exact form. The mapping 
must not assign the same base item to multiple target items nor any target item to 
multiple base items. 

2. Structurally grounded: If base predicate is m^ipped onto target predicate T^, then 
all of Bj's arguments must also map onto the arguments of T^*. In the case of predicates 
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Bi and T< forming sets of lel&tions, called relational groups (e.g., AHD, SET), structural 
grounding is weakened to require th&t at least one one of their atguznents be paired. 

3. Domain'Specifie: The mapping must not violate general representational or domain- 
specific constraints that apply to pairing specific base and target descriptions. 

4. Semantic simUariiy: Predicates are allowed to match if they are (1) identical, (2) 
functionally analogous, or (3) have a common generalization. These are defined below. 

5. Loyah U ;he mapping is elaborating an existing mapping, then it must respect the 
correspondences of the existing mapping. 

6. Selection: Both systematidty and contextual relevance determine which relations are 



2.4.1 Structural Constraints 

The first two constraints together enforce SMT's struct iral consistency requirement (see 
alsoKling, 1971;\IV^nston, 1980; Rumelhart & Norman, 1981;Purstciii, 1983; Carbonell, 1983a; 
Indurkhya, 1987). However, it deviates from the standard definition :n one important re- 
spect. The requirement that the mapping be structurally grounded does not cross the 
boundaries of a relational group. A relational group is distinguished as an unordered collec- 
tion of relational structures that may be collectively referred to as a unit. They correspond 
to the abstract notion of a "set** and are associated to predicates taking any number of 
arguments. For example, a set of relations joined by the predicate AID defines a relational 
group. Other examples include the axioms of a theory, a :tomposable compound object, 
or the relations holding over an interval of time. Intuitively, we would like to say that two 
groups correspond without requiring that their contents are exhaustively mapped. 

If base and target propositions each contain a group a^ an argument, the propositions 
should not be prevented from matching if the groups' members cannot be exhaustively 
paired. For example, the set of relations 



mapped. 



B: IinpliMCAad(Px,P2,P3), Pj 
T: InpliMClnd(Pi,Pi), P^] 



(1) 



should match better than the set of relations 



B: lavliesCAndCPi.Ps.Ps), P4] 

t: p[, p'„ p; 



(2) 
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The origiiial model of structural consistency would score (1) and (2) equally, since the 
Implies relations of (1) would not be allowed to match. This is a particularly important 
consideration when matching sequential, state-based def.criptions (e.g., the behavior of 
a system through time). The set of relations describing a pair of states often do not 
exhaustively match or are of different cardinality. Yet, higher-order relations over states, 
such as temporal orderings, are vital and must appear in the mapping. 

As discussed in the previous section, purely structural constraints are insufficient to 
prevent structure rearrangement. Just as a purely syntactic predicate calculus is able to 
recognize the mutual inconsistency of P(z) and -iPCz) but not the mutual inconsistency 
of Solid (x) and GasCx), a purely structural account of mapping will fail to recognize 
some inconsistencies in the mapping. Thus, additional constraints are supplied to capture 
important general representational or domain-specific knowledge about the structures being 
manipulated. 

At the current time, only very general, representational constraints are defined. These 
prevent structure rearrangement across relational groups, such as mixing and matching 
elements of compound objects, theories, or temporal states. For example, the following 
rule preserves temporal relationships: 

MH(B<,T<) A T«nporally-Scop«d(B<) A T«Bporally-Scoped(T<) A 
MH(B,-,T,) A T«Bq)orally-Scop6d(B,) A T«iBporally-Scop«d(Tj) A 
{ [EqualTim«(Bi,B,) A Di»jointTiiM(ToT,)3 V 
[DisjointTiflw(Bj,By) A EqualTimCToT^O] } 
ConflictiagCl!H(B,-,Tj, MH(B,-,T,)] 

where Conflicting indicates two mutually inconsistent pairings (see Section 3.2). 

2.4.2 Semantic Similarity 

Semantic similarity is a crucial component of the correspondence problem, during both 
mapping and transfer. During mapping, it is important to limit the possibilities and ensure 
that a semantic correspondence is being made. During transfer, it is important to guide 
search for correspondents potentially absent from the given target description and to guide 
creation of a new predicate if one is needed. Semantic similarity enables the adaptation 
of knowledge to analogous situations without requiring that the knowledge be expressed 
exactly the same in all situations. 

Most accounts of analogy enforce semantic similarity by testing for predicate identical- 
ity (e.g., Carbonell, 1981; Centner, 1983). As noted earlier, identicality is too restrictive to 
be of use for many reasoning situations. In across-domain analogies, it reqmres that the im- 
portant relational structure has been properly decomposed, as in Hott6r-*than(x»y) being 
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represented as Cr«at«r-thaa[tein)6ratur«(x) , t«iBp«rature(y)]. Additionally, there 
are many near or within domain relationships that are diffictilt, if not impossible, to reduce 
to identicality. The concepts of objects Insid«-of containers and solids Diasolved-in 
liquids axe very different (Burstdn, 1985). Yet, it is clear these relations are candidates for 
analogical mapping. 

In models that allow non-identical relations to match, the preferred solution is to con- 
suit a hierarchy of predicate types and allow relations with a common ancestor to match 
(Wmston, 1980; Burstein, 1983; Wellsch k Jones, 1986). The match score may then be 
inversely proportional to how distant the relations are in an ISA hierarchy (Winston, 1980; 
Wellsch k Jones, 1986). Take for example Burstdn's CIRL program. When a base relation 
is carried into the target domain, CARL moves up an ISA hierarchy seeking a relation with 
argument type restrictions loose enough that the target objects satisfy those constraints. 
CARL oplies the same process to action predicates (e.g.. Trans is a parent action of Ptrans 
and Ntrans). 

However, in solving the identicality problem this approach raises a number of additional 
problems. There is the potential to go too high in the network (e.g., to Relation), and 
ultimately place every relation in correspondence with every other relation. This may be 
prevented by using forests of hierarchies, ensuring that semantically close relations are 
mutually reachable, while ensuring that each tree dossn't go too high so as to achieve 
meaningless generality. Yet, this presents & dilemma. Besides seeming rather ad hoc, 
it places a strong burden on the user to a priori know which relations should belong to 
common clusters and which should not. Some approaches adopt a halfway point, by forming 
general classes of predicates such as action, relation, plan, goal, etc. and restricting mapping 
within these boundaries (Burstein, 1985). The generalization tree models further suffer 
from being single parent hierarchies. This is easily fixed using a more sophisticated ISA 
lattice, but then we must decide which parental branch to follow and what the consequence 
of multiple intersections will be. 

These appro*' i fail to recognize an important point: TuappabUity is context sensitive. 
When a predicate instance is used in some context (e.g., in a chain of inference), it is used 
because it denotes certain characteristics about the world deemed important for that con- 
text. Thus, propositions should be semantically similar with respect to their functional role 
in the surrounding structure. Movement within the generalization hierarchy is dependent 
upon the characteristics the predicate was intended to possess. When moving up the hier 
archy, a point may be reached where those properties are no longer present. Furthermore, 
this implies that mappable relations need not always share a common ancestor (except 
perhaps at the uppermost relation node). 




There are numerous examples of contextuaUy sensitive similarity. The property Cylindrical 
of one cup should map to the property Hag*Eandl« of another cup if the role of these expres- 
sions is to support liftability. As another example, whether or not we use Containad-in, 
Diss olrad-in, or Absorb«d-in depends upon the surrounding context. Each has a priv- 
ileged set of inferences. You can squeeze out liquid absorbed by a solid, but you can't 
squeeze salt out of the water it has dissolved in (at least not in the same way). 

Therefore, if two predicates are not identical, they may still be considered semantically 
similar and eligible for mapping if they are functionally analogous: 

Definition 2.2 (Functionally analogous) Tu^o expressions are considered functionally 
analogous and may match if they provide the same inferential support in the context of the 
structures being matched. 

There are several ways to determine the inferential support an expression is providing. 
When an expression's role is explicit in the structure, it is particularly simple. For exam- 
ple, expressions P and R will be placed in correspondence when matching lBpli#s(P,q) 
with Zapli«8(R,q), since their respective roles are to deductively support Q. Being struc- 
turally explicit is the only method for determining whether two expressions are functionally 
analogous during mapping. 

Another method addresses the problem of compiled knowledge which is absent from 
the explicit structure. This method may be used during transfer when seeking information 
about unmatched expressions. AI systems tend to use compiled knowledge, in which inter- 
mediate reasoning steps are absent to promote effir ency of use. Indeed, this is the central 
foal of explanation-based generalization (Mitchell et al., 1986; OeJong & Mooney, 1986). 
In PHIIEAS, this is a common occurrence, owing to its use of QP theory sjmtax. A pro- 
cess definition consists of a set of antecedents indicating when the process wiU be active 
and a set of effects which hold when the process is active. While effective for reasoning, 
a great deal of information is compiled away by the model builder. No direct links be- 
tween antecedents and effects are available. The actual "Hheory" about the domain being 
represented, such as why each antecedent is present in the process description, is absent. 
To address this problem, PHIIEAS uses an augmented process description. A cache slot is 
added to link necessary prerequisites of the effects relations to the antecedents that satisfy 
those prerequisites. For example the effect relation 

CtransCsoufce-stttjQF, destinaiion-stuff, rate) 

indicates a continuous transfer of some ^stuff^ (e.g., fluid or energy) from a source to a 
destination. By continuity of motion, a necessary prerequisite for Ctrans is the existence 

35 



4 7 



of some physical path between the source and destination. Therefore, this prerequisite 
information, and how it is satisfied, should be pr -sent in the cache slot of any process that 
uses Ctraas. In the dissolving process, this appears as: 

(Satisfies (lHMrs«d-ia ?solnt« ?solntion) 

(Pr«r«q (Ctrans (aaount-of ?solnt«) (concentration ?solntion) ?rat«) 
(Physical-Path ?solnt« ?solntion ?path) 
(Kotion-Continnity) ) ) 

When investigating the role of lHMrs«d-in, a post-mapping process may consult the 
dissolving process' cache slot to determine that a relation not supporting Physical-Path (or 
the analogue of Physical-Path if this is part of the mapping) cannot be used as an analogue 
for IflMrssd-in. Importantly, this information indicates that it is the physical connection 
aspect of Imrs«d-in that is important (e.g., as opposed to preventing exposure to the 
atmosphere). This will be described more during the discussion on transfer (Chapter 5). 

The definition of functionally analogoua is a general statement of the specific prob- 
lem derivational analogy (Carbonell, 1983a) attempted to address for planning situations. 
Specifically, a fundamental component of analogy is knowing why the relations being con- 
sidered for mapping are there. A good analogizer is able to recognize alternate ways to 
achieve equivalent functionality. In problem solving, adapting a prior solution to a new 
problem instance requires just that, adaptation of the prior solution. This task is greatly 
simplified if we know why decisions were made the way they were, so that we can satisfy 
the intent of the decision with out necessarily adhering to the same decision. 

In the absence of such background knowledge, proximity within a generalization hi- 
erarchy is a good heuristic to use. A portion of the hierarchy used in PHIHEAS appears 
in Figure 2.3 (see Appendix E for the complete set). There is a good chance that the 
characteristics desired of a relation will also be present in its nearest neighbors in the gen- 
eralization hierarchy. The farther the relations are in the hierarchy, the less likely this will 
be true. Thus, the strength of this type of match should diminish in proportion to the 
distance within the generalization hierarchy between the predicates matched. I call this 
the minimal ateauion principle.* Two constraints are used to limit improper generaliza- 
tions. First, only structurally motivated pairings are made. Specifically, two predicates 
may only match via minimal ^^cension if theb corresponding parent relations have already 
been paired. Second, the ISA hierarchy is assumed to be shallow and highly disconnected. 
Thus, the hierarchy in Figure 2.3 iudici't'M that minimal ascension is not allowed to match 
a quantity to a tjrpe of physical proxin:. 

'Tetm raggested by Dedie Centner. The wme type of principle hat long been used in stractoied 
indnction, and appean u the etimbmg the generalization tree nde in (Michalski, 1983). 
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Figure 2.3: A portion of the predicate isa hierarchy used in Phineas. 



2.4.3 Selection: Weighing systematicity and relevance 

e Selection Problem: What factors should decide how the "bcsf mapping is cho- 
sen? 

The importance of context and problem solving goals in analogical processing is a 
recurring theme (Burstein, 1983; Carbonell, 1983a; Greiner, 1988; Holyoak, 1984; Kedar- 
Cabelli, 1985b). It is generally agreed that in problem solving situations, the current reason- 
ing goals have a strong influence over what base information is retrieved and how the results 
of analogy are ultimately evaluated and used. Debate centers around contextual effects on 
mapping. At one extreme, Centner (19G8) proposes that problem solving influences the 
processes preceding and following mapping, but play no role in the mapping process which 
is guided solely by systematicity. However, from the discussion of Section 2.3, it appears 
that this is insuflident to guide the mapping process. At the other extreme, Holyoak (1984) 
maintains that problem solving relevance alone drives the entire mapping process. This 
model is only applicable to analogical problem solving and ignores other uses of analogy. 
Other approaches are more agnostic on the issue, since neither systematicity nor goal related 
relevance appear explicitly in the mapping process. Relevant base information is selected 
during access, with mapping consisting of its reinstantiation and potential adaptation for 
the target case (Burstein, 1983; Carbonell, 1983a; Kedar-Cabelli, 1985b; Grdner, 1988).^ 

^Gieiner (1988) docs consider a host of alteraative interpretations and conld be said to be following 
a form of systematidty. During target instantiatiou of a base abstraction, Greiner's fewe$i eonjeeiure$ 
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Since an explicit matdi is never formed and mapping is heavily constrained (i.e., through 
teacher-supplied hints or completely unambiguous mappings), choosing among alternative 
interpretations is avoided. 

From pragmatic theories, as well as problems experienced with a purely structural 
model, we can sec that relevance can be an important influence on analogical mapping. 
However, people can evaluate analogies in context-free s jttings (Gentner & Landers, 1985; 
Rattermann k Gentner, 1987; Clement k Gentner, 1988), supporting the view that sys- 
tematidty plays a significant role ia people's selection criteria. People are able to process 
an analogy without a goal in mind. They are able to "sec" similarity without necessarily 
being in the middle of solving some problem. Prom a purely computational perspective, 
mapping should be able to spot similarity in the absence of goals since the various uses of 
analogy and similarity should be achieved a single computational model of mapping. 
However, it should also be able to adapt to the needs of a surrounding problem solving 
context if one exists. 

Thus, a hybrid approach, influenced by both systematidty and contextual relevance, is 
used. In the absence of problem solving goals, systematidty serves as the sole criterion for 
selection. When problem solving goals are present, interpretations containing the relevant 
base information are preferred.* An important ^ature is that contextual information is not 
required; its presence serves to influence the normal operation of mapping, not replace it. 

2AA Candidate Inferences and the Validity Problem 

One of the important functions of mapping is to identify base information plausibly inferable 
for the target situation. This requires consideration of validity, the central problem in using 
analogically derived knowledge. 

• Validity Problem: What is the basis for having confidence in the analogically 
proposed iiiferences? 

The desire for validity effects each stage of analogical processing. Relations that are pre- 
dictably useful are sought during access. Mapping should only propose pKusible inferences 
that follow from the set of known correspondences. Finally, analogy is an approximate 
process and there is a tradeoff between generality of the process design and guaranteed 
correctness of what it produces. Analogical reasoning systems, by definition, only retain 

hentistic (fffc) prefers analogies that require adding the fewest new coigectures (i.e., candidate inferences). 
This is equivalent to preferring the masdsral match to the relational system called the base abstraction. 

^Recently, Holyoak and Thagard (1988a) have independently proposed a similar hybrid model of com- 
bined systematidty and pragmatic influence. 

38 

ERIC 5 'J 



those inferences or procedures that were a priori derivable from the target domain theory 
(e.g., Carbonell, 1983a; Kedai-Cabelli, 1985b). Analogy in this context is used to improve 
performance. Analogical learning systems must be more cautious and must be able to 
revise their beliefs when inferences are shown to be invalid. 

It is important to identify methods which will increase the likelihood that the analogy 
process produces only valid inferences. All efforts to maximize validity may be divided into 
four general categories: 

1. Representation. Develop specialized knowledge structures such that only certain valid 
(or highly likely) analogies are allowed (e.g., the determinations of Davies & Russell 
(1987)). 

2. Access. Constrain the access mechanism so that only highly likely analogues will be 
retrieved (e.g., Kedar-Cabelli's (1985) purpose-directed mechanism). 

3. Inference production. Use mapping procedures which focus on validity preserving 
features (e.g.. Centner's (1983) systematidty principle). 

4. Post-mapping. Analyze the established analogy to ensure maximum coherency (e.g., 
Clement's (1986) bridging analogies). 

Representational approaches have the iiice property that they make the basis for drawing 
analogical inferences explicit. For example, Russell and Davies (Russell, 1987; Davies 
& Russell 1987) have recently proposed the use of determinations to ensure validity in 
analogical inference.* Determinations are specifications of functional dependence. The 
determination P y Q indicates th:it P functionally determines the value of Q since there 
is a unique value for Q given P. This declares a relationship between P and Q that is too 
weak to enable conclusions on its own, but enables a valid conclusion about a target case 
once a base instance has been encountered. 

However, there are a number of problems with this approach. First, it defines away the 
majority of what is intuitively called analogical. For example, given that /(x,y) = z, and 
a base situation showing /(4, 9) = 7, 7 may be recalled when a new problem situation asks 
for the value of /(4,9). Few analogies can be described in these terms. Second, it doesn't 
require the presence of the base description duriiig the analogical act. The target query 
is derivable from the domain theory once the base is given, eliminating interplay between 
base and target to influence inferendng. Analogy involves comparison, measuring degrees 
of similarity, not identicality o *er atomic features in a functional dependency. 

*See (Baker et al., 1988; Clark, 1988) for other approaches to the representation problem. 
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Structural approaches attempt a more content-independent approach: 

Arguments from models involve those analogies which can be used to predict the 
oceurfenee of certain properties or events, and hence the relevant relations are causal, 
at least in the sense of implying a tendency to co-occur. (Hesse, 1966, pg. 78) 

Hesse argues, as does Centner (1980, 1983), that a necessary (but not sufficient) condition 
for validity is that analogy involve the mapping of facts which are "causally" related, rather 
than miscellaneous features. Thus, only if the set of facts Vh are known to cause (or have 
a tendency to co-occur with) the set do we have any basis for believing that Qt will 
also hold given Vt. Centner generalizes this criteria by using the framework of "systematic 
structure" rather than "causality". This creates a general definition which focuses on 
interrelated facts and views this interrelation as supporting predictivity. Thus, a match 
over some ficts sanctions inferences for remaining facts in the larger relational system they 
appear in. For example, consider an example from (Davies & Russell, 1987): A red robin 
is observed and found to have long legs and a scratched beak. Since we would possess 
a relational system relating a robin's various body proportions to its being a robin, but 
no such system relating a scratched beak to its being a robin, only the former would be 
postulated for a second robin. 

Russell and Davies correctly observe that analogy research has not directly addressed 
the validity issue. In the case of structural approaches, structure can easily be added to 
any description and not all structure supports inferendng. Since SMT does not allow one 
to examine the structures being manipulated, all matches and resulting inferences look the 
same, given equal structure topology. For example, I could add a description of how the first 
red robin and the scratch on its beak co-occur in time. Prom a purely structural vantage 
point, there is now no reason not to infer that a second robin should have a scratched 
beak. Thus, it would appear that sharing common relational structure is not sufficient to 
constrain inference production to producing inferences that have some grounds for validity. 

More work is needed on representational and structural approaches. In this thesis, the 
central basis for most of the inferences drawn rests on the belief that similar behavior 
indicates a strong likelyhood of similar causes. Since PHIMEAS is aimed at this one type 
of explanation, this assumption is built in rather than explicitly available to the program. 
This work focuses on post-mapping methods to increase confidence in proposed analo^es 
(i.e., verification-based analogical learning). 

• Consistency Restriciiveness: Two-valued consistency is overly restrictive as a 
basis for analogical processing. 

One dominate point of convergence in analogy research has been the -entrality of con- 
sistency in guiding and evaluating analogy production (Indurkhya, 1987; Creiner, 1988; 
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Kedar-Cabelli, 1985a; Hall, to appear). Many go beyond consistency to require that anal- 
ogy produces d-sound analogical inferences (e.g., Carbonell, 1983a; Kedar-CabcUi, 1985b; 
Davies k Russell, 1987). 

Consistency is an important component of all forms of reasoning. Yet, we must be 
careful not to afford it too much import. Limiting analogy to strict consistency requires 
that analogy be a monotonic process. However, analogy often causes the questioning of 
beliefs and may lead to a complete change in world view. Thus, a weaker form of consistency 
is needed. One which takes into account the cost of overthrowing or revising prior beliefs 
for the benefits of a more coherent belief state. Thagard's (1988) work on explanatory 
coherence may be viewed as a step in that direction. 

While this thesis does not offer a general solution to this problem, it is important to 
keep in mind so as to avoid theories that crucially depend upon strict consistency. 

Additional Correspondence Subproblems 

There are two identifiable subproblems to the correspondence problem not yet discussed. 
The first has several instantiations: 

• Non^MonoUmie Binding Problem: Defme a binding to be any pairwise corre- 
spondence between atomic units. Thus, a binding may be an analogical corre- 
spondence between two objects, or the binding of a variable during unification. 
The non-monotonic (N-M) binding problem occurs when an influx of new in- 
formation is allowed to overturn an existing binding set, either due to internal 
inconsistency or in favor of a superior binding. 

The following two observations are partly responsible for the N-M binding problem: 

1. Not all candidate inferences are real inferences. In realistic memories, it is unlikely 
that an analogixer will be operating on every item which comprises the representation 
of the base or target domains. Instead, a subset of the base or target descriptions are 
used. Since a candidate inference is with respect to the subset of the base or target 
being processed, it might not be an inference at all if a different aspect of the target 
were fetched. Alternatively, a candidate inference may represent a place where the 
analogizer failed to detect semantic similarity with an available target descri|ition. If 
there is more than one way to say something, retrieving a different representation may 
make the similarity more discemiMe. Thus, the idea that base and target knowledge 
can be fully prepackaged for the mapping component breaks down in general. There 
may be a need to reprobe memory to seek further information. 
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2. Not all matches are real matches. Suppose the base description contains Ri{hi)y 
Rt{bi), and Rzibi)- If the target description has no instances of Ri or iij, and the 
only instance of R^ is then, if consistent, fcj wiU be placed in correspondence 
with ti. This isn't necessarily a good match to make. Otaer target objects may exist 
that provide a better fit, but were not mentioned in the original target description. 
Alternatively, conjecturing the existence of an unknown target object may be prefer- 
able. What i£Ri{ti)j a possible resulting inference, is known to be false? Rather than 
reject the analogy, it may be preferable to question if there was sufficient grounds to 
conjecture ti as the correspondent of In. 

The N-M binding problem appears as the spurious match problem mentioned in Sec- 
tion 2.3 and discussed above as "not all matches are real matches". It isn't limited to 
weak matches however. It may occur due to ambiguity over which of two ^d matches to 
choose. It may also occur when a previously unnoticed target item is detected and found 
to be a better correspondent for some base item than its current target correspondent. For 
example, consider the analogy between a spring-block oscillator and an LC circuit. In the 
spring-block system, position is easily measurable while force is not. In the LC circuit, 
voltage is easUy measurable while charge is not. Thus, an analogy focusing on observables 
would place the block's position quantity in correspondence with the circuit's voltage quan- 
tity. However, a more thorough analysis of the two systems would show that force should 
map to voltage while position should map to charge. 

Second, it affects variable bindings during abductive inference. Abductive inference 
is required when a set ' unmatched base objects are carried into the target and target 
correspondents sought. Suppose the goal is to seek an object satisfying the conjunction 

P(?«) A Q(?x) A R{7x) 

Traditional, sequential backchaining on each conjunctive subgoal fails. Suppose there are 
two objects, a and 6, where P{a) is the only knowledge about a, and Q{b) A R{b) is the 
only knowledge about b. Sequential subgoaUng wiU be unable to propose b as the be,3t 
binding for ?x, since the candidate binding set is {a} after showing P{a). Seeking a set 
of unknown base objects can compound the problem. Due to potential interdependendes 
between the unknown objects, what may be a best match for one unknown not be 
best for matching the other unknowns. 

The N-M binding problem has struck others before me. It is the impUdt motivation 
behind Kedar-Cabclli's (1988, pg. 131-132) near miss assumption. She mentions that her 
method of selecting the first available analogue and working to adapt it has the potential 
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to miss the best set of correspondences. This appears to be an instance of the N-M binding 
problem. 

I claim that approaches attempting to solve a series of conjxincts, rather than seeking 
global similarity, will suffer from the N-M binding problem. A further implication of the 
problem is that one cannot always assume that an existing mapping may be consistently 
extended. This conflicts with the common view that the role of mapping is to consistently 
extend an existing mapping (e.g., as in extending a partial mapping produced during access) 
(Burstein, 1985; Kedar-Cabelli, 1985a; Hall, to appear). 

The problem is compounded by the use of packaged descriptions, in wt'ch two bod- 
ies of knowledge are separated and designated "base" and "target" (e.g., Winston, 1980; 
Gcntner, 1983; Wellsch & Jones, 1986). This is often fine for the base description, \5hich 
is typically very fanailiar a-^.d rclevaut information is easily selected. However, it is often 
difficult to know everything that is needed for mapping until mapping is attempted, at 
which ^oint mapping may spawn further probes into memory aimed at holes in the match. 
This entire issne i$ discussed farther in Chapters ? and 5. 

• Reformulation Problem: What operations on the structures being examined are 
allowable? 

This has two related facets. First, there are multiple ways to represent equivalent 
information. Matching two representations may thus require reformulating the descriptions 
in attempt? to recognize identicality or similarity (e.g., gr#at#r-than and leas-than to 
dte a particularly rimple case). Second, being a good analogizer means being able to 
recognize alternative ways of doing things that still satisfy the intent of the analogy. For 
example, recognizing an alternate way to satisfy the roles in a story or function in a plan- 

For what share of this problem should mapping be responsible? Clearly, reformulations 
arising from problem solving impasses must occur during the use phase (e.g., as in deriva- 
tional analogy (Carbonell, 19b3a)). However, what about minor deviations in situation 
or problem specification that are known prior to problem solving execution? Should the 
mapping component share responsibility with the use phase for adapting to these minor 
changes? It is important to know as soon as possible just how well and in what way two 
potential analogues are actually analogous. This is important to evaluating a potential 
analogue - an unnecessarily poor match may result in the rejection of a highly useful ana- 
logue. It iH also important to getting the proper inferences and thus the proper search path. 
Furthermore, non problem solving settings lack a performance element to sort through the 
results of a proposed analogy. The results of mapping are the desired, end result. Thus, 
the information acquirable during mapping should be maximized without losing the ability 
to perform quick, inexpensive mapping. 
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Chapter 3 

The Structure Mapping Engine 



Thf. Ability to detect similarity by identifying a set cf correspondences between descriptions 
of two situations is a prerequisite to general analogical processing. Furthermore, in PHIHEAS 
the same matching program is used in both the access and mapping procedures. For these 
reasons, I will discuss the matching program used in PHIHEAS first. 

The Structure-Mapping Engine (SME) is a general tool for performing various types of 
analogical matchings. SHE was developed in collaboration with Ken Forbus and Dedre 
Centner to simulate Centner's Structure- Mapping theory of analogy.^ The intent was to 
develop a jingle program that could model aU of the similarity comparisons sanctioned by 
Centner's theory, such as literal nmilarity, mere appearance, as well as analogy. The only 
constraint built into the program is that the mapping preserve structural consistency. All 
other constraints and all evaluation criteria are supplied in the form of match rules that 
specify the matcher s operation. Thus, while SHE was originally designed to simulate the 
comparisons of structure-mapping theory, it can simulate the space of theories consistent 
with this single criterion as well. Even so, many of ihe theoretical distinctions embodied 
in SHE have their origins in Centner's Structure-Mapping theory. 

Given descriptions of a base and target, SHE const acts all consistent mappings between 
them. Each mapping consists of pairwise matches between statements and entities in 
the base and target, the set of analogical inferences sanctioned by the mapping, and an 
evaluation score for the mapping. Tor example, suppose SHE were given descriptions of 
the situations sh. wn in Figu^ 2.1: water flowing from a beaker to a vial and heat flowing 
from hot coffee to an ice cube (described in Figure 3.1). SME might offer several alternative 
analogical mappings. In one, the central inference would be that water flowing between 

*Puithennore, since portions of this chapter ate taken from (Falkenhainer, Forbm, k Centner, 1987), 
some of the writting credit for this chapter most go to my coauthors. 
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WATER-FLOW 

^^^^QREaTER^ vial. w«t«r. pip«) 

PRESSURE (b«ak«r) PRESSURE (vial) 

GREATER 



LIQUID (watvr) 



DIAMETER (b«ak«r) DIAMETER (vial) 



HEAT-FLOW 

GREYER 

TEMPERATURE (coffa«) TEMPER ATOriE(ic«-cube) 

FLOW(corrM, Ice-cube. heat, bar) 

LIQUID (coffee) 
FLAT-TOP (coffee) 



FLAT-TOP (water) 

Figure 3.1: Simplified water flow and heat flow descriptions. 



the containers corresponds to heat flowing from the coffee to the ice cube. Alternatively, it 
might map water to coffee, since they are both liquids. Which interpretation has a higher 
evaluation score depends on the match rules in use. 

This dr.apter first summarizes the SHE algorithm^ and describes the contextud struc- 
ture mapping configuration (i.e., a new set of match rules) along with additional features 
that were left out of the earlier paper. Finally, SHE is analyzed from both analytical and 
empirical perspectives. The general program will be called SHE, the program running the 
rules of Centner's Structure-Mapping theory will be called SHE5jrT, and the program run- 
ning the rules of contextual structure mapping will be called SHE<75ji#. I start by reviewing 
she's conventions for knowledge represe>itati<m, which are essential to understanding the 
algorithm. 



3.1 Representation Overview 

A typed (higher-order, in the standard sense) predicate calculus is used to represent facts. 
The constructs of this language are entitiesj predicatesj and dgroups: 

Entities: Entities are lo^cal individuals, i.e., the objects and constants of a domain. 
Typical entities include physical objects, their temperature, and the substance they sjce 
made of. 



Pi idicates: The term "predicate" refers to any functor in a predicate calculus statement. 
Predicates declared to SHE may be divided into primitive categories. In this work there 



'Fof details tee (Falkenhainei et al., 1987). 
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are only two: rekiion and function (Falkenhainer et al., 1987 a' ^ discusses an attribute 
cniegozy). 

All predicates must be declared to SME prior to use (the declarations for PHIHEAS are 
listed in Appendix E). Each declaration defines the predicate's arity, a name and type 
(sort) for each argument, and the next most general type (sort) the predicate maps to. For 
example, the declaration: 

(d«fPr«dicat« PRESSUHE ((obj physob)) function 

:«xpr«ssion-t7p» "intansiTC-quantity) 

states that the predicate F'ESSUREis a one-place function. Its argument is called obj and 
is of type physob. An expression using it, such as PRESSURE (water 1) , maps an expression 
of type physob, waterl, to an expression of type pressure, which in turn is of type 
intensive-quantity, the next node up in the ISA hierarchy. 

Predicates may additionally be declared commuiativej in which the order of arguments 
is unimportant when matching, and/or n-ary, in which the predicate can take any number 
of arguments. Examples of commutative n-ary predicates iudude AHD, OR, and SET. 

Dgroup: For simplicity, predicate instances and compound terms are called 
expressions. A description group, or dgroup, is a collection of entities and expressions 
concerning tl -m, considered as a unit. The expressions and entities in a dgroup will be 
referred to collectively as items. 

Dgroups are defined with the def Description form: 

(defDescription (DescriptionName) 

entities (,{Eraityj,) , (Entityi) , . . .»{Entityi)) 
expressions ( (ExpressioTtDedarations) ) ) 

where {ExpressionDedarations) take the form 

{expression) or 

{{expression) :nam« {ExpressionName)) 

The :naine option is provided for convenience; {expression) will be substituted for every 
occurrence of {ExpressionName) in the dgroup's expressions when the dgroup is created. 
For example, the description of water flow depicted in Figure 3.1 was given to SME as 
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(d«fDMcriptioB tiapl«*wat«r*f low 

•ntitioc (watM b«ak«r Ti^l pip«) 

•xprMsions (((flow boakor Tial watar pipe) rnana vflow) 
((prattnra baakar) tnama prastura-baakar) 
((prattnra Tial) :naHa prastnra^Tial) 
((gxaatar prattura^baakar prattura-vial) :nana >pra8tiira) 
((rraatar (diaaatar baakar) (diamatar vial)) rnaiM >diamatar) 
((causa >pratsiira wflow) :naBa cansa-flow) 
(f lat*-top watar) 
(liqaid watar))) 

The description of heat flow depicted in Figure 3.1 was given to SHE as 

(dafOascription siopla-haat-f low 

antitias (coffaa ica*cuba bar haat) 

azprassiont (((flow coffaa ica**aiba haat bar) :nana hflow) 
((tai^ratiira coffaa) znaaa taavcoff aa) 
((taqparatnra ica«*cnba) ;naBa ta^>*ica-cnba) 
((graatar taq>«*coffaa tanp^-ica-cnba) tnana >taiiparatiira) 
(flat-top coffaa) 
(liqaid coffaa))) 

3*2 SME Alf;orithm Overview 

Given descriptions of a base and a target, represented as dgroups, SHE builds all structurally 
consistent interpretations of the comparison between them. Each interpretation of the 
match is called a global mapping^ or ^map. Gmaps consist of three parts: 

1. Carreepondences: A set of pairwise matches between the expressions and entities of 
the two dgroups. 

2. Candidate Inferences: A set of new expressions which the comparison suggests holds 
in the target dgroup. 

3. Evaluation Scare: A numerical estimate of match quality. The characteristics used 
to determine the score depend on the rule set and may include the gmap's structural 
properties and its contextual relevance. 

Match rules specify which local elements can match, additional restrictions on how they 
may be combined, and how these combinations are scored. These rules are the key to SHE's 
flexibility. To build a new matcher one simply loads a new set of match rules. This has 
some important advantages. First, a theory of analogical mapping may be represented more 
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dcdarativcly, and the consequences of each theoretical conmutment easUy traced. Second, 
it enables a single program to emulate different analogy systems for comparison purposes. 
Conceptually, the SHE algorithm is divided into four stages: 

1. Local match construction: Finds all pairs of {(Baseltem), (Targetltem)) that po- 
tentially can match. A Match Hypothesis is created for each such pair to represent 
the possibility that this local match is pari of a global match. 

2. Gmap construction: Combines the lo< d matches into maximal consistent collections 
of correspondences. 

3. Candidate inference construction: Derives the inferences suggested by each gmap. 

4. Match Evaluation: Attaches evidence to each local match hjrpothesis and uses this 
evidence to compute evaluation scores for each gmap. 

Each computation will now be described, using a simple example to illustrate their 
operation. 

3.2.1 Step 1: Local match construction 

Given two dgroups, SHE begins by finding potential matches between items in the base and 
target (sec Figure 3.2). Allowable matches are specified by match constructor rules, which 
take the form: 

(MHCrul0 {(Trigger) (BaseVariable) (TargetVariable) (Condition)) (Body)) 

There are two possible values for (Trigger). A : filter trigger indicates that the rule 
is applied to each pair of items from the base and target, creating a match hypothesis 
when the items satisfy the condition. For example, the following rule hjrpothesizes a match 
between any two expressions that have the same fanctor: 

(MHCrule (rfilter ?b ?t rtest (equal (expression-functor ?b) 

(expression-fimctor ?t))) 

(inetall-NB ?b ?t)) 

An : intern trigger indicates that the rule should be nm on each newly created match 
h]rpothesis. These rules create additional matches suggested by the given match hjrpothesis. 
For example, hypothesizing matches between every pair of entities would lead to comU- 
natorial explosions. Instead, : intern rules are used to create match hjrpotheses between 
entities in corresponding argrmient positions of other match hypotheses. 
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□ - ME between predicates 

A- MH between entities (Emap) 

Figure 3.2: Local Match Constmction. The water flow and heat flow descriptions of Fig- 
ure 3.1 have been drawn in the abstiact and placed to tne left and right, respectively. The 
objects in the middle depict match hypotheses. 



The result of running the match constructor rules is a collection of match hypotheses. 
MH{bijtj) denotes the hypothesis that bi and match. Standard graph-theory terminology 
will be used to describe the structural properties of graphs of match hypotheses (e.g., 
offspring, descendants, ancestors, root). 

For example, the result of running the match constructor rules on the water flow and 
heat flow dgroups of Figure 3.1 is shown in Figure 3.2 (see also Figure 3.3). In this 
example, the literal similariiy rule set of structure-mapping theory is used (i.e., SKEsMT/Lsy 
There are several points to notice in Figure 3.3. First, there can be more than one match 
hypothesis involving any particular base or target item. Second, in this rule set, predicates 
are paired due to identicality, while entitle are matched on the basis of their roles in the 
predicate structure. Thus while TEMPERATURE can match either PRESSURE or DIAMETER, 
GREATER cannot match anything but GREATER. Third, not every possible correspondence 
is created. Local matches between entities are only created when justified by some other 
match. This significantly constrains the number of possible matches in the typical case. 



MH-12 

B: Rat-top-4 
T: Flat-top-6 



MH-13 
B: Llqulct-3 
T: Llquld-5 




MH-14 
B: water 
T: coffee 



MH-9 
B: Wflow 
T: Hflow 



MH-2 

B: Pressure-beaker 
T: Temp-coffee 




MH-1 

B: >Pressure 
T: >Temperature 



MH-6 

B: >Dlameter 
T: >Temperature 



MH-a 

B: Dlameter-2 
T: Temp-Ice-cube 



MH-4 
B: beaker 
T: coffee 



MH-5 

B: vial 

T; Ice-cube 



Figure 3.3: Water Flow / Heat Flow Analogy After Local Match Construction. Here we 
show the graph of match hypotheses depicted schematically in Figure 3.2, augmented by 
links indicating expression-to-aiguments relationships. Match hypotheses which are not 
descended from others are caded rooU (e.g., the matches between the GREATER predicates, 
MH-1 and MH-6, and the match for the predicate FLOW, MH-9). Match hypotheses between 
entities are called Emaps (e.g., the match between beaker and coffee, MH-4). 



3.2.2 Step 2: Global Match Construction 

The second step in the SME algorithm combines local match hypotheses into collections of 
global matches (gmaps). Intuitively, each global match is the largest possible set of match 
hypotheses tVat depend on the same one to one object correspondences. 

More formally, gmaps consist of maximai, consistent collections of match hypotheses 
that are structurally grounded. A collection of match hypothtj-fs is structurally grounded 
if it satisfies the grounding criterion: If a match hypothesis MH is in the collection, then 
so are the match hypotheses which pair up all of the arguments of MH's base and target 
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items.^ The grounding criterion preserves connected predicate structure. Consistency is 
determined by the rule set and in all cases to date includes the restriction that the mapping 
be one-to-one. A collection is maximal if adding any additional match hypothesis would 
render the collection inconsistent. 

The formation of global matches ib composed of two primary stages: compute consis- 
tency relationships and merge match hypotheses. 

5.2.2.1 Compute consistency relationships 

For each match hypothesis, generate the set of entity mappings it entails and the set 
of match hypotheses it is inconsistent with. This information simplifies the detection of 
contradictory sets of match hypotheses, a critical operation in the rest of the algorithm. 
The result of this stage of processing appears in Figure 3.4. 

The following two sets manage inconsistency and are crucial to this computation: 

Definition S.l (Conflicting) Given a match hypothesis MH{bijtj), the set 
Conflicting(MH{bijtj)) consists of the set of match hypotheses {hat are pairwise incon- 
sistent with it. 

The set Conflicting(MH{bijtj)) only notes local inconsistencies (see Figure 3.4). For 
example, under a one-to-one restriction, Confiicting(MH{hi^tj)) would include the set of 
match h^ lotheses that represent the alternate mappings for hi and tj. 

Cor, xting and the grounding criterion combine to produce the fc^owing set: 

Definition 3.2 (NoGood) The set NoGood(MHi) is the set of all match hypotheses 
which can never appear in the same gmap as MHi. This set u recursively defined as follows: 
ifMHi is an emap, then NoGood{MHi) = Canflicting{MHi). Otherwise, NoGood{MHi) 
is the union ofMHi ^s Conflicting set with the NoGood sets for all of its descendents, i.e., 

NoGood{MHi)^Conflicting{MHi) (J ^MHi e ArgsiM^^^iMEj) 

3.2.2.2 Merge match hypotheses 

Compute gmaps by successively combining match hypotheses as follows: 

^Tlus represents a deTiation from previous accounts, snch as (Falkenhainer et al., 1987). In addition 
to the grounding criterion, a one-to-one restriction was enforced by SMS. One-to-one mappings are now 
an option, implementable in the rules, so that iraxiations on one-to-one may be tested. Additionally, an 
important exception to the grounding criterion is now allowed, as discussed in Section 3.3 
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(a) Form initial combinations: Combine interconnected and consistent match hypotheses 
into an initial set of gmaps (Figure 3.5a). 

(b) Combine dependent Gmaps: Since base and target dgroups are rarely isomorphic, 
some gmaps in the initial set will overlap in ways that allow them to be merged. 
The advantage in merging them is that the new combination may provide structural 
support for candidate inferences (Figure 3.5b). 

(c) Combine independent collections: The results of the previous step are next combined 

to form maximal consistent collections (Figure 3.5c). 

Conunutative predicates are supported during step (a). When multiple, complete 
matches exist for the arguments of two commutative predicates, a copy of the match be- 
tween them is made and assign-sd to each complete, consistent combination of argument 
matches. For example, if AID (bi,b2) were matched to AHD(ti,ta), step (a) would replace 
the single match hypothesis between the two AMD's with two alternate match hypotheses, 
and place them in different gmaps, corresponding to the two alternate ways to pair their 
arguments. 

3.2.3 Step 3: Compute Candidate Inferences 

Associated with each gmap is a (possibly empty) set of candidate inferences. Candidate 
inferences are base expressions that would fill in structure which is not in the gmap (and 
hence not already in the target). Not just any information can be carried over - it must 
be consistent with the substitutions imposed by the gmap, and it must be structurally 
grounded in the gmap. By structural grounding we mean that its subexpressions must at 
some point intersect the base information belonging to the gmap. 

The candidate inferences often include entities. Whenever possible, SHE replaces all 
occurrences of base entities with their corresponding target entities. If a candidate inference 
contains a base entity that has no corresponding target entity (i.e., the base entity is not 
part of any match hypothesis for that gmap), SHE introduces a new, hypothetical entity 
into the target. Such entities are represented as a skolem function of the original base 
entity (i.e., (:skol«m basa-ratity)). 

In Figure 3.6, gmap #1 has the top level CAUSE predicate as its sole candidate inference. 
In other words, this Gmap suggests that the cause of the flow in the heat dgroup is the 
difference in temperatures. If the FLOW predicate was not present in the tar«r'\ then the 
candidate inferences for a gmap corresponding to the pressure inequality ,vould be both 
CAUSE and FLOW. Note that GREATER-THAI [DIAMETER (co«e«) , DIAMETER (ice cube)] is 

52 



□ • MH between predicates 
MH between entities (Emap) 

Figure 3.4: Water Flow - Heat Flow analogy after computatior of Conflicting relationships. 
Simple lines show the tree-like graph that the grounding criteria imposes upon match 
hjrpotheses. Lines with dxcular endpoints indicate the Conflicting relationships between 
matches. Some of the original lines from MH construction have been left in to show the 
source of a few Conflicting relations. 




(») (b) (c) 

MH between predicates 
^ MH between entities (Emap) 

Figure 3.5: Gmap Construction, (a) Merge step 1: Interconnected and consistent, (b) 
Merge step 2: Consistent members of the same base structure, (c) Merge step 3: Any 
further consistent combinations. 
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SHE Variion 2E 
Inalogieal Match from SVITER-FLOV to SEZAT-FLOH. 

Rnl« Fil«: litoral-iinilurity.niloi 



• MH'ff I • Gmapi I lit.2nd.Vor8t | RolGroups I 

14 I 3 1 5.99 / 3.94 / 2.44 I OFF I 



Total Run Tias: 0 Hinutoi, 0.657 Soeondi 
BHS Ron Tiao: 0 Miantoi, 0.441 Soeonds 

Bolt Gaapi: { 1 } 

Gmap #1: (>PRESSURE >TE1IP) (PRESS-VIAL TEMP-ICE-CUBE) (PRESS-BEAKER TEHP-COFFEE) 
(HFLOH HFLOV) (BEAXER COFFEE) (VIAL ICE-CUBE) (PIPE BAR) (HATER HEAT) 
V«ight: 5.9917 

Candidato Inforoneoi: (CAUSE >TEHP HFLOV) 

Gmap #2: (DIAM-BEAKER TEHP-COFFEE) (DIAH-VIAL TEHP-ICE-CUBE) (>DIAMETER >TEHP) 
(BEAKER COFFEE) (VIAL ICE-CUBE) 
Vaight: 3.9377 
Candidata Infaraneai: { } 

Gmap #3: (LIQUID-¥ATER LIQUID-COFFEE) (FLAT-¥ATER FLAT-COFFEE) (WATER COFFEE) 
tfaight: 2.4446 
Caadidata Infaranea*: { } 

Figure 3.6: Complete S¥Zsmt/ls interpretation of Water Flow - Heat Flow Analogy. 



not a valid candidate inference for the first Gmap because it does not intersect the existing 
Gmap structure. 

3.2.4 Step 4: Compute Evaluation Scores 

Typically a particular base and target pair will give rise to several gmaps, each representing 
a different interpretation of the match. Often it is desired to select only a single gmap, for 
example to represent the best interpretation of an analogy. Evaluation criteria may include 
structural properties, such as systematidty, as well as contextual relevance, validity, and so 
forth. An evaluation score for each match hypothesis and gmap is found by rtmning match 
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evidence rules and combining their results/ These scores are used to rank-order the gmaps 
in selecting the "best" analogy. For example, the rule 

(rul« ((:intam (MH ?b ?t) rtast (and (•xprMsion? ?b) (expression? ?t) 

(•q (functor ?b) (functor ?t))))) 
(assert! (ia?)lie8 sane-functor (MH ?b ?t) (O.S . 0.0)))) 

states "If the two items are facts and their functors are the same, then supply 0.5 evidence 
in favor of the match hypothesis." The rules may also examine match hypothebes associated 
with the arguments of these items to provide support based on systematicity. This increases 
match hypothesis evidence with the amount of higher-order structure supporting it. 

Returning to Figure 3.6, note that the "strongest" interpretation (i.e., the one which 
has the highest evaluation score) is the one we would intuitively expect. In other words, 
beaker maps to coffee, vial maps to ice-cube, water maps to heat, pipe maps to bar, 
and PRESSURE maps to TEMPERATURE. Furthermore, it sanctions the candidate inference 
that the temperature difference is what causes the flow. 

3.3 Modeling Contextual Structure Mapping 

Contextual structure mapping is modeled within the rule set given to SME, which defines 
SMEc5Af • Since the entire rule set consists of only 22 rules, I will describe the complete 
set here, using predicate calculus notation. The operators "A" and "V" correspond to 
their standard meaning, "-." is a procedural test for not present, whose status will not 
change during the course of processing, and "■=►" indicates a production which asserts the 
consequent. Variables will be preceded by "?" and are assumed universally quantified. The 
complete set of equivalent lisp production rules used by SKEcsAf arc provided in Appendix B. 
The description follows the program decomposition used in the previous section. 

3.3.1 Step 1: Local match construction 

SME begins by running match constructor rules, which install match hypotheses between 
individual base and target items that may plausibly match. 

The first three rules match expressions by examining the predicates they use and the 
inferential support they provide. 

*The management of numerical <!vidence is performed by a Belief Maintenance System (BMS) 
(Falkenhainer, 1988b), The BMS is much like a standard TMS, using horn clauses as justifications. Ho'v- 
ever, tiie justifications are annotated with evidential weights, so that "degrees of belieP m«»y be propagated. 
A modified version of Dempster-Shafer formalism is used for expressing and combining evidence. 
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Rule 1 (Same Functors) Two expressions may match if they use the same predicate and 
their predicates are not part of an a priori correspondence set. 

Equal [functor ( ?b) , functor (?t ) ] A 

-•S«nction«d-Pairing(functor(?b) ,?anyt) A ->Sanctione<l-Pairing(?anyb, functor (?t)) 
^ MH(?b,?t) 

The previous rule suffers from dependence on identicality. It fails to take into accour 
the context in which an expression is being used. Two very different predicates may support 
the same conclusion in a given context. This is addressed by the following rule. 

Rule 2 (Pupctionally Analogous) Two expressions are considered functionally analo- 
gous and may match if they provide the same inferential support in the context of the 
structures being matched. 

Iinplicational(?b) A Iinplicational(?t) A 
Equal (functor Ccon8«qu9nt(?b)] , functorCcon8«qu«nt(?t)]) 
-•Conn«ctiv«[con8«quMit(?b)] A -•Conn«ctiv«Ccon8equent(?t)] 
=> CMH(ant«cad«nt(?b) ,antac«d«nt(?t)) 

A Funckion-of [ant«c«dent(?b) ,Support-cf (con8«qu«nt(?b)] 

A Providoa-f unct ion Cant«c«d«nt ( ?t ) , Support-of (consequent ( ?t ) ) ] ] 

If two expressions are implicational and their consequents match, then this rule will 
match their antecedents. At the current time, the predicates IMPLIES, CAUSE, and SUPPORTS 
cuc considered implicational. 

The following cAe respects established mappings. 

Rule 3 (Sanctioned Pairing) Two items match if they are a prion designated as match- 
ing. 

Sanctioned-Pairing(?b,?t) => MH(?b,?t) 

Other rules are used to create match hypotheses between entities in corresponding 
argument positions of other match hypotheses. In this manner, entities are only matched 
if sanctioned by their position in iiaitching relations: 

Rule 4 (Non-Commutative Corresponding Arguments, Entities) Two entities match 
if they occupy the same argument position of non-commutative predicates that have already 
been matched and neither entity is paH of an a priori correspondence set. 

MU(?bi,?ti) A -•CoinniutativeCfunctor(?bi)] A -•CoiimiutativeCfunctor(?ti)] A 
Entity(?bj) A Entity(?t2) A Children-of?(?b2,?t2,?bi ,?ti) A 

-•Sanctionr d-Pairing(f jnctor(?b) ,?anyt) A -•Sanctioned-Pairing(?anyb,functor(?t)) 
■-^ MH(?b2,?tj) 
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Five more rules exist to form matches between the arguments of expressions that have 
already m - idled. The first is like Rule 4, except that the corresponding arguments must be 
expressions whose predicate is a function. The next pairs corresponding arguments sharing 
a common generalization: 

Rule 6 (Non-Commutative Corresponding Arguments, Common Generalization) 
Two expressions may match if they occupy the same argument position of non- commutative 
predicates and use predicates having common ancestors in the generalization hierarchy. 

MH(?bp,?tp) A -iConmutativ^CfunctorCTbp)] A -'CoBmutativ«Cfunctor(?tp)] A 
Childr«n-of?(?b,?t,?bp,?tp) A CoiBmon-AncMtor?Cfunctor(?b) ,functor(?t)] A 
-,Sanction«d-Pairing(functor(?b) ,?anyt) A -'Sanction«d-Pairing(?anyb,functor(?t)) 
=*• MH(?b,?t) 

The remaining three match constructor rules pair the arguments of commutative pred- 
icates (i.e., the "corresponding arguments" condition, children-of , is removed). These 
three rules generate all allowable pairings between the arguments of two commutative pred- 
icates. 

3.3.2 Step 2: Global match construction 

Once an initial set of match hypotheses is formed, the pairwise consistency of match hy- 
potheses stated by Conflicting is used to combine tliem into maximal, consistent gmaps. By 
the one-io-one criterion, these include match hypotheses representing alternate mappings 
lor bi and tj. 

Rul? 10 (One-To-One (expressions & entities), Base Case) Two match hypotheses 
are mutually inconsistent and may not appear in the same gmap if they pair the same base 
item to different target items. 

MH(?b.?ti) A M!i(?b,?t2) A ^Equal(?ti,?t2) 

=» ConfllctingCMH(?b.yti),MH(?b . J 

A similar rule establishes that two match hypotheses pairing the same target item to 
different base items are mutually inconsistent. An additional pair of rules maintains the 
one-to-one mapping for predicates in the same manner used for expressions and entities. 

Othe. elements of Conflicting (MH{bi, tj)) are defined by representation specific and 
domain specific rula. The following rule is used to prevent the temporal reairangement 
problem described in Cectiou 2.2. 
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Rule 14 (Temporal Preservation) Two match hypotheses are mutually inconsistent if 
they pair base items always co-occurring in time, EqualTime(?bi,?b2), to target items that 
never overlap in time, Disjoint! ime(?bi,?b2). Likewise, two match hypotheses are mutually 
inconsistent if they pair target items always co-occurring in time to base items that never 
overlap in time. 

MH(?bi,?ti) A MH(?b2.?t2) A 

T«mpoz xly-Scop«d(?bi) A Twnporally-ScoptdCTbj) A 
Temporally-Scop«d(?ti) A Temporally-ScopedCTtj) A 
{ [EqualTlm«(?bi,?b2) A Di8jointTi««(?ti .Ttj)] V 
[DisjointTia«(?bi,?b3) A EqualTim«(?ti,?t2)]} 
=> Conllictiiig[MH(?bi.?ti),MH(?b2.?t2)] 

The rearrangement problem exists for any undecomposable collection. The following 
rule preserves the compound object "contained liquid". 

Rule 15 (Compound Object Preservation (contained liquids)) Two match hypothe- 
ses are mutually inconsistent if they pair items representing a contained liqu^-i and its con- 
tainer with another contained liquid and something other than its container respectivt ' f. 
MH(?bi,?ti) A MH(?b2.?t2) A 

Containad-Liquid(?bi) A Contain«d-Liquid(?ti) A 
{ [Cont&in«r-of (?bi,?b2) A -iContainer-of (?ti ,?t2)] V 
[-.Contain«r-of(?bi,?b2) A Contain«r-of (?ti ,?t2)]} 
=> Conflicting[MH(?bi,?ti),MH(?b2,?t2)] 

One element of contextual structure mapping required a change to the SME program itself 
- relaxing the structural grounding criterion to exclude relational groups. In the standard 
procedure for copying match hypotheses between commutative predicates (described in 
section 3.2.2.2), complete match sets for their arguments were required. In the case of a 
match between oredicates forming relational groups, the requirement is weakened. Only a 
single match between their arguments need exist, and aU maximal, consistent collections 
of argument pairings constitute a valid copy of the commutative pairing.* 

3.3.3 Step 4: Compute evaluation scores 

Once the gmaps have been formed (step 2) and their corresponding candidate inferences 
computed (step 3), each niatch hypothesis and gmap is assigned a match evaluation scoie.^ 

'If thia sounds like the SHE match prob:. n in miniature, in many ways it is. The same gmap merge step 
procedure is used for this operation. Rather than "relational groups", one could view these as a dgroup 
within a dgroup. 

"It should be pointed out that numerical evidence is used to provide a simple way to combine local 
mformation concerr g match quality. These weights have nothing to do with any probabilistic or evidential 
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As described in Chapter 2, both structural and relevance criteria are used to compute these 
evaluation scores. 

The evidei ce rules have a slightly different syntax from the rules described above. 
Rather than implying a particular match hypothesis form, they supply evidence through the 
form lmpli»B({antecedent), {consequent), {weight)) (see Falkenhainer, 1988b; Falkenhainer 
et ai., 1987 for an explanation of the evidence mechanism). In each rule described below, 
(weight) wiU be given as i^<parameter-name>. The numeric values of these parameters 
are then summarized at the end of the section. 

The first evidence rule supports the first two match constructor rules, which examined 
the predicates in use. It supplies evidence in inverse proportion to the distance within the 
generalization hierarchy between the predicates matched. This distance is the number of 
nodes in the minimal path between the matched predicates in the hierarchy, reducing to 
one in the case of predicate identi'-3dity. 

Rule 16 (Minimal Ascension) // the expressions comprising a match hypothesis were 
paired due to common ancestors in the generalization hierarchy, then supply an evidence 
score inversely proportional to their distance (number of nodes) in the hierarchy, equal to 
#M A/distance. 

MH(?b,?t) A Expre88ion(?b) A Express ion (?t) A 

-iSanctioned-PairingCfunctorCTb) ,?anyt) A -iSanctioned-Pairing(?anyb,func*orC?t)) 
=> Iiq>liMCtyp«-Match, MH(?b,?t), 

quotient (#MA ,path-l«ngth(functor(?b) ,f unctor(?t)) )] 

Rule 17 (Sanctioned Pairing Evidence) If two items are a priori designated as match- 
ing, then supply an evidence score of #SF to : match. 

Sanctioned-Pairing(?b,?t) ^ ImpliMCsanctioned-pairing, MH(?b,?t), #SP] 

Rule 18 (Functionally Analogous Evidence) // the base expression of a match hy- 
pothesis provides inferential support f in the base situation, and the target expression can 
provide that infe^ntial suppoH, then supply an evidence score of #FA to the match. 

MH(?h,?t) A ExprMiion(?b) A Expression (?t) A 
Function- of (?b,?f) A Provides -funct ion (?t,?f) 

^ ImpliesCAndCFunction-of (?b,?f) ,Provide8-function(?t,?f )), 
MH(?b,?t). fFA] 



information about the base or target per se. Additionally, the evidence scores used here are lower than 
thoae previously described for SMBsmt- It was found that the prior scores pushed the weights too far into 
the high end of the 0..1 spectrum, offering little difference between fair matches and very good matches. 
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Systematicity is supported by passing evidence from a match involving a relationship to 
the matches involving its arguments. The following rule accomplishes this by propagating 
#SYS% of a match hypothesis' belief to its offspring. 

Rule 19 (Systematicity, Non-Commutative Case) 

MH(?bi,?ti) A MH(?b2,?t2) A Childr«n-of(?b2,?t2,?bi,?ti) 
=> lBpli«8[MH(?bl,?tl), MH(?b2,?t2). fSYS] 

A second rule is used to propagate systematicity through commutative predicates by re- 
moving the corresponding arguments test (i.e., Children-of ). The more matched structure 
that exists above a given match hypothesis, the more that hypothesis will be believed. Thus 
this "trickle down" effect provides a local encoding of Centner's systematicity principle. 

Finally, contextual relevance is used to provide additional evidence for those matches 
supporting the current reasoning needs of the global reasoning system. There are two 
factors to consider. First, which relations are more salient for the current reasoning task? 
For example, in PHINEAS the central focus is to explain an observed behavior. Matches 
identifying corresponding behavior are given greater import than matches for other features. 

Rule 21 (Behavioral) // a match hypothesis is between two behavioral relations (e.g., 
Increasing, Decreasing), then supply an evia nee score of ^Behavior to the match. 

MH(?b,?t) A £<cpr888ion(?b) A Ezpr«88ion(?t) A 
Behavioral-R8lation(?b) A B8havioral-R8lation(?t) 

=> Implies [behavioral, MH(?b,?t), fBehavior] 

Second, which gmaps offer candidate inferences providing needed knowledge" For exam- 
ple, if a cause for E is sought, gmaps offering the Inference CauseCC.E) would be preferred. 
In PHINEAS, the relation B-Explains(T,5) is used to state that the set of theories T 
explain the behavior B. This predicate is then sought as part of the candidate inferences 
for a gmap. 

Rule 22 (Provides Relevant Inference) If a gmap contains a candidate inference sup- 
porting a behavioral explanation, then supply an evidence score of#RInfto the gmap. 

CI(?gmap, B-Explain8(?ba88-th«ory,?targ8t-b8haviorj) A 
Curr«nt-0b8«rvation(?target-behavior) 

=> implies [CI (?gmi4>, B-ExplainsCTbase-theory.Ttarget-behavior)), 
?gmap. fRInf] 

The specific parameter settings used in SHEcsM are: 
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Evidence ParAmeter 


Value 




n A 


#SP 


0 4 


#FA 


0.8 


#SYS 


0.8 


#Behavior 


0.4 


#RInf 


0.9 



Whenever numeric weights are used to influence a system's function, there is danger of 
(1) tailoring for particular examples and (2) sensitivity to spediic values. Values for the ev- 
idence parameters in SKEcsM were selected based on long experience with SKEsmt and tl^^- 
eral intuition. Of course, a more formal sensitivity analysis is required. However, SMEcsm's 
evidence parameters have not changed throughout the development of PHIHEAS and the 
same rule set was used for all examples. Both Ken Forbus and myself have conducted 
preliminary empirical studies to detenrine SMEsatt's sensitivity to the space of possible 
parameter settings. On simple examples such as the small water flow, heat flow analogy 
described in Section 3.2, it was found that simply having non-zero settings is sufficient. 
On more complex analogies, such as the short stories discussed in (Skorstad et nl., 1987), 
performance is robust but not completely insen^tive to parameter settlings. Most crucial 
is the setting for systematidty. This must be high Tor SMEs^r to demonstrate a marked 
preference for higher-order systems of relations. The findings are too preliminary to draw 
conclusions. Analysis is tedious due to the size of the space being considered - ran^ng 
SMEsjir's eight parameters through four values each yields 65,536 data points for a single 
base-target pair. Although these findings should apply equally well to SHEc^jf , studies on 
SMEc75Jif l^&ve not yet begun. 



3.4 Analysis 

This section presents a critical review of SHE from both analytical t ad empirical perspec- 
tives. First, a summary of SME's complexity is given. This is followed by an empirical 
discussion of alternate domain representations and their impact on performance. Finally, 
she's generality is described in the context of ite existing applications. 
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3.4.1 Complexity analysis 

(Falkenhainer et al., 1987) presents a d^taHed analysis of each phase in the SHE algorithm. 
This section summarizes those results. In the discussion, Nb and Nt are the nimiber of bise 
and target dgroup items, respectively. Their average is denoted by N. 

1. Load match construction: In both SKEsmt and SKEcsm, match constructor rules are 
simple and we may assume rule execution takes unit time. Under this assumption, 
both :f ilt«r and :int«rn rules require 0{Ni, * Nt) or roughly 0{N^). However, in 
practice the : intern rules have a run time of approximately 0{N). 

2. Calculating Con£icting: SVEcsm assigns a Conflicting set to each match hypothesis, 
MH{bi,tj) which represents the alternate mappings for 6,- and tj. Worst case is 
0{N% while the best case performance is 0{max{Nhi Nt)). 

3. Emaps and NoGood calculation: Each match hypothesis is operated on once, which 
in the worst case is 0{N^). 

4. Gmap construction: Global matches are formed in a sequence of three merge steps: 

(a) Assuming that most of the match hypotheses will appear in only one or two sub- 
graphs (some roots may share substructure), the first merge step is proportional 
to the number of match hypotheses, or worst case 0{N^). 

(b) In the worst case, this step is equivalent to Step 4(c), which can display 0{NI) 
performance. If the base and target dgroups give rise to a match hypothesis 
graph having a single, consistent root, then *here is only one gmap and the 
second (and third) merge steps are constant-time. Typically, the second merge 
step is very quick and displays near best-case performance. 

(c) The complexity of this final merge step is directly related to the degree of struc- 
ture in the base and target domains and how many different predicates are in 
use. This issue is reexamined in the next section. Worst-case performance oc- 
curs when the description langi ^e is flat (i.e., no higher-order structure) and 
the same predicate occurs many times in both the base and the target. In a 
language with a single, unary predicate, this reduces to the problem of finding 
all isomorphic mappings between two equal size sets, which is 0{NI). In the 
other extreme, when the base and target dgroups give rise to a match hypothesis 
graph that has but one root, the third merge step is a constant-time operation. 
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5. Finding candidate inferences: In the worst case we have an upper bound of 0{N^). 
In the best case, there will only be one gmap and no candidate inferences, producing 
constant time behavior. 

6. Selection score computation: 

The complexity of the evidence mechanism (BNS) is difficult to ascertain, and ranges 
from 80% of she's processing time on smaU examples to less than 5% on large exam- 
ples. The BNS maintains dependencies between evidential results and may be elimi- 
nated if their explanation is not required. The original SHE (Falkenhainer et al., 1986) 
used a specialized 0{N^) system. 

3.4.2 Implications for Representation 

The proper representation becomes an issue in SHE due to its significant impact on speed 
performance. Highly structural, nested representations provide an important source of 
constraint on generating potential matches. They tend to make the semantic interrelations 
explicit in the structure of the syntax. For example, a theory might be represented as a 
sequence of axiom statements 

Axioa-of(Tl, axiomi) 

or as 

Tli«ory(Tl» SETCoxiomi, ... axiomi — axioms)) 

While SHE is a ^le to process domain descriptions in any predicate-based format, the 
latter is significantly more efficient. The reason is that the set representation for theories 
places the related axioms syntactically together, reducing the number of spurious local 
matches. 

Several different representations for temporal intervals were empirically tested with SHE. 
In each, the base described the fou; state cyclic behavior of a spring-block configuration, 
while the target described the four state cyclic behavior of an oscillating LC circuit. Static 
situation information was also included in all representations, such as (BLOCK block) and 
(COHIECTED spring block). The first representation tested was a standard situation 
calculus syntax: 

(0URII6 (CniSTirr (POSmOI spring)) SI) 
(0URII6 (C0ISTA9T (POSITIOI block)) SI) 
(OURIIG (OECREASnC (POSITIOI spring)) S2) 
(0URII6 (OECREiSIIG (POSITIOI block)) S2) 
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(DtntllG (COISTAIT (POSmOI ipring)) S3) 
(DURIIG (COISTAIT (POSITIOI block)) S3) 
(DUEnc (IICRE1SII6 (POSITIOI ipring)) S4) 
(DURIIG (UCmSIIG (POSITIOI block)) S4) 
(MEETS SI S2) 
(MEETS S2 S3) 
(MEETS S3 S4) 
(MEETS S4 SI) 

This produced 306 possible gmaps and took SHE a total of 2 minutes. 

The second lepiesentation was taken from Hayes' (1979) definition of history as a de- 
scription of a single object's behavior over time. The predicate AT is used to specify a slice, 
the intersection of an object with a period of time (either interval or instant).^ In this 
representation, the oscillatory behavior is described as: 

(COISTAIT (POSITIOI (AT ipring SI)}) 
CCOISTAIT (POSITIOI (AT block SI))) 
(DECREASIIG (POSITIOI (AT ipring S2))) 
(DECREASHG (POSITIOI (at block S2))) 
(COISTAIT (POSITIOI (AT ipring S3))) 
(COISTAIT (POSITIOI (AT block S3))) 
- (nCREASnG (POSITIOI (AT ipring S4))) 

(hcreashg (posmoi (at block S4))) 

(MEETS SI S2) 
(MEETS S2 S3) 
(MEETS S3 S4) 
(MEETS S4 S.) 

This syntax produced 756 possible gmaps and took SHE a total of 29 minutes, 52 seconds. 
It should also be noted that this is the only description of time th \ is a priori immune 
to the structure rearrang<jment problem described in Section 2.2. This is because the time 
token is present at the bottom-most level of description; there is no smaller expression that 
contains the object token spriog and does not contain the state token SI. 

Finally, the nested temporal syntax currently used in PHIHEAS was tested. The syntax 
for a temporal interval, called a situation, is: 

SITUATIOI({iVameToien), {Relatiortj}) 

Using this syntax, the oscillator was described as: 



'Thif wu the repretentation used in an earlier version of PHIIEAS (Falkenhainer, 1986). 
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Table 3.1: SHE performance on alternate temporal representations. 



Repiesentfttion 


#MH'8 


# Gmaps 


Total BMS 
run time 


Total match 
run time 


Situation Calculus 


164 


306 


0:27.72 


1.33.18 


Slices 


212 


756 


0:49.34 


29:02.32 


Nested 


125 


46 


0:1" "7 


0:06.63 



NOTE: All times are given in minute8:seconds.fraction. Total match time is total SME run time 
minus BMS run time. 



(HEETS (SmmOI SI (SET (COISTilT (POSITIOI fpring)) 

(COISTilT (POSITIOI block)))) 
(SITUiTIOI S2 (SET (DECREASII6 (POSITIOI spring)) 

(DECREASII6 (POSITIOI block))))) 
(HEETS (SmJiTIOI S2 (SET (DECREASHG (POSITIOI spring)) 

(DECREASII6 (POSITIOI block)))) 
(SmiTIOI S3 (SET (COISTilT (POSITIOI spring)) 

(COISTilT (POSITIOI block))))) 
(HEETS (SITUiTIOI S3 (SET (COISTilT (POSITIOI spring)) 

(COISTilT (POSITIOI block)))) 
(SITOiTIOI 5% (SET (nCBEiSIIG (POSITIOI spring)) 

(nCREiSIIG (POSITIOI block))))) 
(HEETS (SITOiTIOI S4 (SET (IICREiSIIG (POSITIOI spring)) 

(nCBEiSIIG (POSITIOI block)))) 
(SnUiTIOI SI (SET (COISTilT (POSITIOI spring)) 

(COISTilT (POSITIOI block))))) 

This syntax lesnlted in 46 gmaps and took a total of 26 seconds to compute. Notice, 
the time foi computing the match alone (sans BUS) dropped from over 29 minutes for the 
slices notation to under 7 seconds. The apparent redundancy of the description (each situ- 
ation appears twice) is virtual, not real. To SHE, all syntactically identical subexpressions 
in a description map to the same internal expression. Forthermore, expressions may be 
named for use as arguments to other expressions. For example, the above description was 
actually given to SME as four named situation expressions. Their temporal ordering was 
then given in the same way it was for the other two representations, for example, (MEETS 
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situationl 8ituation2). situationl is now a pointer to a situation description^ rather 
than a situation token. 

These results are summarized in Table 3.1. The difference in speed is primarily due to 
the operation of merge step 2, which combines matches sharing a common base structure. 
The set notation for time enables merge step 2 to know that matches for state SI behavior 
of the spring-block oscillator should be placed in the same gmap, thus reducing the number 
of possibilities in merge step 3. 

While perhaps somewhat unorthodox, this representation has some desirable properties. 
First, most expressions are simpler since temporal references are implicit. At least in Utxns 
of analo^cal processing, indexing a situation's facts this way drastically reduces the number 
of match hypothesis combinations possible. Second, it makes the temporal du'^terrg of 
relations explicit in the syntax. 

Similar conventions have been used in PHIHEASfor a number of representation problems, 
such as the representation of theories, with comparable savings. 

The key implication for analogical processing is that syntax should mirror semantics. If 
there is a strong first or second order relationship between two expressions, this relationship 
should be obvious from the syntax. Such relationships are typically not mirrored in the 
syntax of standard first order predicate calculus. For example, 

(6r6at«r-*than x y SI) ' 
(Break y S2) 
(Cause SI S2) 

does not syntactically reflect the important relationship that exists between the Great ar- than 
expression ani the Braak expression. On the other hand, 

Caus6CGr6at6r-Than(x,y), Br6ak(y)] 

makes their relationship structurally explicit. Similar arguments have been made in favor of 
semantic net representations (Winston, 1984), despite their logical equivalence with FOPC 
(Hayes, 1977). 

3.4.3 Generality 

There are a number of factors in evaluating the success and generality of a program. While 
it is important to be able to demonstrate more than one example, and SME has successfully 
run on over 40, conclusions fxom sheer number of examples should be limited. For exam- 
ple, in the water flow - heat flow example, if heat flow were repl<kced by an isomorphic 
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description of dectric&i flow, SHE wouldn't know the difference. Should we say these repre- 
sent two examples, or merely one? How then has tailorability been reduced and coverage 
determined? First, only a small set of rule files has been used throughout the various stud- 
ies involving SHE. For example, every PHIHEAS example presented in this thesis used the 
same rule set described earler. Second, PHIHEAS places the user one step farther fxom SHE 
hy being in charge of generating SHE's input and inspecting SHE's output. Furthermore, 
PHIHEAS requires that the representations used by SHE be able to satisfy a specific perfor- 
mance task. A representation developed to perform useful inlerences has fewer arbitrary 
choices than a representation developed specifically for analogical matching. Finally, SHE 
has demonstrated a high degree of generality through its multiple uses. It has been used in 
cognitive simulation st idles, served as a component in other systems (including PHIHEAS), 
and been configured to emulate other analogy programs. Specifically, SHE has been used 
in: 

• Cognitive simuhtior. of Structure-Mapping theory: SHE has been successfully used in 
studies comp^ng the psychological predictions of structure-mapping theory (Sko- 
rstad el si., 1987). It has also been used to apply the concepts of Structure- Mapping 
theory to metaphor understanding (Centner et al., 1987). 

• As a component in SEQL; Janice Skorstad has used SHE as a component in SEQL, a 
concept learning program that forms generalized structural concept descriptions from 
a sequence of samples (Skorstad et al., 1988; Skorstad, 1989). SEQL has further been 
used for studjring sequence effects in concept formation. 

• Simulating SPRO'ITER; Hayes-Roth and McDermott (1978) describe a technique for 
partial matching of structural descriptions called interference matching. Their SPROUTER 
piogram uses the matching to form a generalized conjunctive concept description of 

a set of target examples. SHE and the generalization module it contains has success- 
fully reproduced the first two (out of three) examples discussed in (Hayes-Roth & 
McDermott, 1978).^ The third example has never been tried. (Falkenhainer, 1988a) 
briefly discusses how SHE may be configured to emulate SPROUTER. 

*SMS contains a module that tnkes a gmap produced hj the matdung component and returns 
three alternate, generalised coiuunctive concept descriptions covering the base and target instances (see 
Ftlkenliainer, 1988a). The three alternatives correspond to (1) onlj what base and target have in com- 
mon identically, (2) everything base and target have in common (e.g., PRESSORE matched to TEMPERiTiniE 
turns into FUICTIOi-3), and (3) everything base and target have in conunon, plus the candidate inferences 
proposed for the target. 
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• Simulating ACME; Holyoak and Thagard (1988a) describe a new system, ACME, which 
may be described as a descendant of SME, with SME's three merge steps replaced by 
connectionist relaxation techniques. An ACME rale set has been used to reproduce a 
number of examples described in their paper. This rale set is discussed further in 
(Falienhainer, 1988a) and the two programs are compared in Section 10.2.1.5. 
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Chapter 4 
Access 



Access is the process of reminding and recognition. In the context of physical analogies, 
the first step towards explaining a newly encountered observation is attempting to relate 
it to understood situations. Is it an instance of an understood phenomena? Could it be if 
a few assumptions were made? Is it similar to an understood phenomena? For analogical 
learning, the goal is to retrieve theories most likely to provide an accurate explanation. 

Analogical access has proven to be a difficult problem for AI and few analogy sys' «ms 
address it. As reviewed in Section 2.1, access is typically a matter of being presented with 
a complete base representation, some form of specific cue, as in a teacher supplied hint 
(Burstdn, 1983; Greiner, 1988), or the specific goal concept (Kedar-Cabelli, 1985b). Most 
work on access falls under case-based reasoning (e.g., Kolodner, 1984; Ashley & Rissland, 
1987). In these systems, access typically proceeds by indexing the features of the situa- 
tion description into a memory organized as a discrimination net or decision tree. These 
techniques have two problems. First, they tend to be sensitive to incomplete information 
and the ordering of the discrimination tree. Second, they limit retrieval to looking for 
features that match exactly. This may be appropriate for case-based reasoning, which can 
be considered as a form of within domain analogy, but is insufficient for across-domain 
learning. 

Psychological evidence provides two suggestions about access. First, analogical accessi- 
bility tends to be governed by surface similarities, also known as mere-appearance matches 
(Ross, 1984; Centner k Landers, 1985). This is the kind of recall we wish to avoid in 
expert problem solving. Surface similarities are not necessarily predictive. Yet how can 
we a priori know what will be predictive if at the same time we're trying to learn that 
prediction-generating knowledge? 

Second, in human processing of physical analogies and many types of problem solving, 
ir ^gery seems to play a major role in both accessibility and evaluation (Dreistadt, 1968; 
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Drdstadt, 1969; Kaufinann, 1979; Kosslyn, 1980; Shepard & Cooper, 1982; MiUer, 1986). 
Some go 80 far as to say "analogy production related to problem solviuj is a nsual process" 
(Kaufinann, 1979, page 119). These studies show imagery has impUcations for remindings 
as weU as evaluation - the ability to "try out" a proposed solution before actuaUy acting 
on it, an aspect discussed more in Chapter 6. 

While our goal is not to build a psychological model, the second suggestion will prove 
useful. First, visual processing appears to have the ability to abstract and store vast 
amounts of information and detect patterns tLat are relatively easy to manipulate and 
recall. Second, time varying, dynamic behavior is highly predictive of underlying causal 
mechanisms, sometimes more so than an incomplete, static situation description. Both 
factors are important for computational studies of analogy. Given current technology, an 
autonomous account of the first will not be attempted. 

This chapter describes the model and implementation of access used in PHIHEiS. It is 
based on the claim that both the behavioral and structural similarity of two phenomena 
can be used to initiate and guide the mapping of an explanatory causal model. 

4.1 Accessing Physical Analogies 

In analogical learning, one starts with a partiaUy understood model of a domain, or a 
teacher-suppUed hint which serves the same purpose. This incomplete model is then used 
to key access and constrain the mapping that serves to complete the model. In learning 
from observation there is no teacher to provide a hint. If the pher ■)menon is new, the learner 
may not even have a partial causal model to drive access. Therefore, some other key into 
memory is required to constrain the mapping process. The only available informatiuD is 
the observable structural and behavioral characteristics of the situation. What must be 
spedfied is how this information can drive access. Specifically, it must 

• protfide i mmonality between systematic pieces of knowledge. Commonality among 
richly interconnected relations will maximize the probability that we have a true 
analogy, rather than a chance, marginal similarity. 

• highlight important aspects, in order to reduce spurious remindings and improve the 
chance of finding the most relevant analogue. 

• be predictive. Time is best spent on hypotheses that have a high chance of being 
correct. 
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One component of access is correlaiionak when seeking goal G, select the subset of 
available information that is known to correlate to G and find an analogue sharing the 
maximum amount of similar information. 

The other component of access is discriminabilHy: when seeking goal G, 'ocus on the 
subset of available information leading to the smallest, yet plausible, analogue candidate 
set. In other words, seek factors that will allow us to select good analogues from a vast set 
of possibilities in the minimal amount of time. 

Both of these components appear in some form in any decision making process. They 
may be in the fcnn of probabilities leading to a particular cause for a given episode, as in 
diagnostic didsion making (Poplc, 1977), an analysis of how to poxtition existing data, as 
in IDS (Quinlan, 1983), or of a more symbolic or heuristic nature. The important point 
is thut if a portion of available information has a tendency to co-occur with the desired 
unknown, and tends to only be associated with that unknown, use it as a predictive key 
into memory. How can this be achieved for the physical analogy task? 

The first clue is that models of physical systems are decomposable into different per- 
spectives ol the same phenomenon, such as structural, behavioral, causa'., and tcleological. 
The amount of knowledge available tends to be different for each. Thus, one clement to 
accessing physical analogies is to recognize that one should key on whatever perspectives 
are most readily available. For exanole, suppose a complete behavioral model and a jcant 
causal model exist for a given system.. In attempting to form a full causal model, it would 
be wise for the access mechanism to key on similar behavior rather than trying to find all 
systems which overlap what lit 'le is already known about the target causal model. Because 
a lot is known about the behavior, it maximizes the relational information available and 
the descriptive space in which to maximize commonalities. This satisfies an in^portant cri- 
terii/n: seek conunonality among richly interconnected lelations to increase the likelihood 
of a true analogy. The problem of drawing distinct boundaries and defining the separate 
categories isn't really important. The lines may often be drawn diff^ rently for different 
situations and by different people. What is important is that different type? of knowledge, 
or aspecUj about the same situation tend to have a cross correlation. It is this correlation 
that I wish to exploit. 

A second important feature of access is the use of abstractions. Attempting to locate 
similar knowledge strictures when given a highly detailed model can be needlessly complex. 
Abstracting out the key features of the detailed model simplifies the task. For example, in 
trying to access behavior similar to water heating on a stove, it is easier to use an abstract 
model of flow than a detailed model of chang.^,^ pressures and moving molecules. Typically, 
dynamic behavior is most readily abstracted through visual processing and is the first model 
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one has for & new domain. In more expert problem solving, structural or causal models 
may be abstracted, enabling the discernment of salient features for triggering the model 
(e.g., ''central-force system"). A single phenomenon or model may be represented using 
multiple, overlapping abstractions. Thus, access should also key on shared abstractions. 

This view should not be confused with theories that treat analogies as shared abstrac- 
tions (e.g., Greiner, 1988). In these approaches, only single model type exists, forcing 
the system to find a shared abstraction with the target concept. Analogy is conjectured as 
equivalent to instantiating a common abstraction. This overly restrictive view of analogy 
does not capture the breadth of the phenomenon. For example, in this thesis a complete 
behavioral model is used to conjecture a new causal model. In tLs manner, the power of 
using shared abstractions may be achieved without giving up the creative power of analogy. 

4.1.1 What types of behaviors? 

Not just any beI:avioral abstraction will do, and occasionally behavior alone is insufficient. 
For example, in searching for an analogue to a spring-block oscillator, we are more discern- 
ing than simply looking for an instance of something "going back and forth". We ♦«-ad to 
know that the spring-mass oscillator is a passive system exhibiting a response to some ini- 
tial perturbation. We also know to prefer mechanical systems such as another spring-mass 
example or a tors' .^n oscillator (i.e., something oscillating due to elasticity). A person walk- 
ing to and from school each diy should not be seriously entertained as an analogue. Thus, 
an additional criterion to abstraction-based access is in having the right abstractions. The 
maximum amount of detail perceptually available should be capiured while still having a 
concise representation. 

Consider what informatiun may be recorded about a hot brick immersed in cold water. 
One particularly useless account would relate just that information: there is some water in 
a bucket, and a brick in sitting in the bucket, immersed in the water. This isn't going to 
help too much in *xpluning the behavior since there is no behavior. A second approach 
m'ght describe two connected situations, Si transitioning io $2. In 5i, the temperature of 
the brick is greater than the temperature of the water and the temperature of the brick is 
decreasing, while the water temperature is increasing. 52 indicates that their temperatures 
are equal and constant. This is a better representation of what is happening, yet it still 
doesn't provide many clues. Finally, we could augment this description with more of its 
temporal or graphical characteristics. The temperatures of the brick and water are asymp- 
totically approaching each other. This provides a sense of "process". The temperatures 
approach each other and stop changing when they are equal. The two rates of change 
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appear to be proportional in some manner. This suggests the concepts of exchange and 
equilibrium. 



4.2 Implementation 

There arc two primary phases in processing a new observation. First, the raw observation 
must be received, translated into qualitative terms, and analyzed for patterns (e.g., sinu- 
soidal oscillation). Second, the qualitative description is used to key into memory to see if 
it corresponds to a known phenomenon, or is similar to any understood phenomena. The 
first component is external to PHIHEAS. The second component is the primary topic of th?s 
section. 

The user translates the raw observation into qualitative values and derivatives (increas- 
ing, constant, decreasing), and divides it into qualitatively equivalent temporal interv. 
For example, a series of values for quantity q^ might be represented as q^ = 0, q^ > qs, 
and Incr«asing(qi) during state Sx, transitioning to qi > 0, qx > q^, and IncreasingCqi) 
during state S3. Additionally, the user provides '^formation about global patterns in the 
data, such as sinuaddai osciilatum or asymptotic approach to zero/ 

Before discussing the specific access process, a few representational conventions will be 
described. 

Behavioral Segments 

We need a way to represent behaviors, potentially at multiple levels of abstraction and from 
different ontolo^cal perspectives. Consider the behavior of alcohol disappearing when 
left sitting in an open container (Figure 4.1). Two classes of information are recorded 
(Figure 4.2): the original scenario description (e.g., (Open beaker2) and (Contain«r-ol 




Figure 4.1: An unexplained obse^-^^'^n of alcohol disappearing from an o^en container. 
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(d«fOb8«rYation opra^^alrohol :nra 
Behavior opwalcohol-bahavic^ 
IndiTidnalf (alcoholl alcohol boakor2) 
Horld ((gnbstanco alcohol) 

(contain^d-liqnid alcoholl) 

(containar baakar2) 

(containar-of alcoholl baakar2) 

(fnbftaxica-of alcoholl alcohol) 

(opan baakar2) 

(baakar baakar2))) 

(dafBSagmant opan-alcohol-baharior :nav 
Charactarizations ((nattar-aoTaoant Tsalf) 
(aonotonic ?galf) 
( cont innoni *BOTamant Tsalf ) ) 
Cofl^onantf (alcohol --goin/^ alcohol-dry) 
Ralationi ((aaati tlcohol-going alcohol-dry))) 

(daf Situation alcohol-going :naw 

Characterizations ((Battar-BOTanant ?galf) 

(aonotonic Tsalf) 
(continTioTif-au>TaBant Tsalf)) 

IndiTidnalf (alcoholl) 

Dynamics ((Oacraasing (Anonnt-of alcoholl)) 

(Constant (Changa-rata (laonnt-of alcoholl))) 
(Graatar-than (A (aaonnt-of alcoholl)) zaro))) 

(daf Situation ^cohol-dry :s^v 
IndiTidnals (alcoholl) 
Dynamics ((Constant (laonnt-of alcoholl)) 

(Eqnal-to (k (laonnt-of alcoholl)) zaro))) 



Figure 4.2: Behavioraldescriptionof an open container of alcohol. The daf Situation form 
identifies a primitive (wnfle state) bseg. The :na« keyword indicates a new observation, 
as opposed to a declaration of an old experience. Old experiences have an additional 
Procassas field for each situation, providing pr aters to instantiations of the theories used 
to explain it. 
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aleoholl b«iJc«r2)) &nd the /namic behavior across time (e.g., (D^crMsing (Amount -of 
aleoholl))). 

Observations are recorded u;ung the del Observation form, which names ihe observa- 
tion, identifies the individuals involved, and indicates the name of its behavioral description. 
The scenariQ description (e.g.^ structural relationships and objects' properties) appears as 
the world component of an observation. 

Behaviors are represented by collections of behavioral segmenis (bseg). A behavioral 
segment represents a slice through the spatial-temporal plot of an observation. A bseg 
may represent either a primitive atuation, or an extended period of time sununarizing a 
collection of more primitive bsegs. No important distinctions are made between the two 
types, since what is considered primitive depends upc . the information available. Added 
detail will typically expand a situation into finer divisions. 

Interrelations between bsegs may be either spatial or temporal in nature. For exam- 
ple, consiaer the multiple representations of the oscillatory behavior of a spring and block 
combination shown in Figure 4.3. The upp r-most, single state description summarizes 
the behavior with an oscUlaUng bseg. This decomposes into the eight state cycle be- 
low it, which is in terms of Velocity (blockl) > 0, DecreasingCVelocityCblockl)], and 
Position(blockl) > 0, etc. The eight state and single state descriptioi are temporally 
related, in this case through abstraction of eight temporal states into a single temporal 
state. Differing; spatial slices through the representation are possible as well For example, 
ihe eight-state cycle describing velocity and position may be divided into two, four-state 
cycles, one describing the velocity's behavior, tie other describing the position's behavior. 
Currently, PHIHE^S uses only the temporal abstraction component of this representation. 

Behavioral segments are recorded using the def BSegMnt form: 

(defBSegMat {BsegName) [tnmwj 

CharactMizations (BehavioralAbstraetions) 

Individuals i{EfUityi). {EtUityt) {Entityi)) 

Explanation {ExplanationSummary) 
Coaponrats {BSegs) 
Relatio!as {TemporalSpatialRelatioM} 
Dynaaitts i{BehaviaralRelatians)) ) 

The Characterizations field describes the behavior's abstract properties, such as asymp- 
totic approach. A bseg's Explanation field lists the process and entity instances that are 
active during its duration. This field is only used for the declaration of past experiences, in 
which these theories are taken from the accepted explanation given the bseg. For example, 
the Explanation field of an explained dissolving observation appears as: 
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08cillating(v«locity) 
OscillAtingCposition) 
9(rd#lay(v«locity, position) 



T«flq>or«l 
Abstraction 



DscCvslocity) 
Inc(position) 




o 



Inc (velocity) 



D«c(po8ition) 



Figure 4.3: Multiple temporal and spatial views of a spring-block oscillator. Each drcie 
represents a temporal interval during which all quantities have a constant value with respect 
to the level of description in use for that view. 



Explanation ((diiaolv* pil ((?iolute . aaltl) v?iolution . «at«rl))) 
(lolution ((?solution . *at«rl)>)) 

which indicates that the dissolving process was active, the process instance was called pil, 
and the two individuals described by dissolving were laltl and Bat«rl. Further, the entity 
declaration solution was active. The Co^Mnrati field identifies the bsegs it teiL.porally or 
spatiaLy summaries. The R«lfttiona field describes the temporal or spatial interrelations 
between the components (e.g., M««ta specifies temporal ordering). The Dynamics field 
records the qualitative value and derivative information of ihe observed behavior. The 
:new option indicates a new observation, as opposed to a declaration of an old experience. 
'^Id experiences are automatically stored in memory, while new observatious are the targets 
for explanation. 

4.2.2 Behavioral Abstractions 

We also need a way to organize behaviors in memory so that they may be retrieved later. 
This involves indexing them usii.? a number of different keys, such as the perceptual prim- 
itives assoi '.ated with theui, what their gross efiects axe, and the domains and situations 
they apply to. 

Behavioral segments are indexed in memory via multiple behavioral abstrxKiion hierar- 
chies. Each tree represents an iaa hierarchy of abstract behavioral characterization . '^hey 
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Mode-of-Force 




aatt«r-moT«Mnt 




vaT«-aoT*jMnt dir«ct-forc« action-at-a-di8tanc« 



pli8sa-changa-8u>T«mMt push ^ill tvist 




solid-phasa-change gas-phase-chan 
^iquid-^asa-changa 



Graphical-Characte*izers 



Movement-Perception 
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cyclic 





corpuscular continuous 
vaTaliko 



linsar asyiptotic-approach sinusoiduS^sij^la-cyclic 

^^^^^^x'--^^'^"^""^*^*^^^^ raq>-cyclic 

approach-constant dual-approach 



dual- approaching dual-approach-1 inish 

Figure 4.4: Behavioral abstraction forest. 

are intended to relate those aspects of a behavior that are perceptually or analytically 
available, yet not easily describable using the standard vaiue and derivative notation of 
qualitative quantities (e.g., positive, increasing, etc.). 

The entire set of behavioral abstractions used in PHIIEAS is shown in Figure 4.4. There 
are four primary categories: 

• Graphical characterizers. These may be considered either visual or graphical char- 
acterizations. At the liighesi; level are linear^ cyclicj and asyrrptotic approach. The 
further spec alizaticns of these are shown in Figure 4.4. For example, asjrmptotic 
approach might describe approaching a constant, or two quantities asymptotically 
approaching each other. 

• Mavenmeni perception. These describe perceptual abstractions of a behavior. Cor- 
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puBodar movement describes a solid object in motion, whereas continm us movement 
describes continuous transfer of matter or energy, as in liquids flowing. Corpuscular 
movements may involve spinning, revolving, reflecting, bouncing, twisting, compress- 
ing. 

• Movement type. This describes the general underlying mode of movement: matter 
simply moving its location or orientation, movement of a wave front through space, 
or matter moving due to a change in state. 

• Mode of force. This describes the source of force being used, if readily obvious to an 
observer. These are action^at-a-distancej or some type of direct or contact force (e.g., 
push, pull, twist). 

Each bseg is multiply indexed in memory under the set of behavioral characteriza- 
tions associated with it. For example, an observation of liquid flowing between two con- 
tainers may be found under three different abstr&ct characteristics: matter-iiiovamant, 
continuou8-mov«Bi«nt, and dual-approach-finish (Figure 4.4). It is important to note 
that a bseg need not be indexed under every one of the four category types listed above. 
Typically, this is not the case. For example, the mode of force category is often not included 
in a description. 



4.2.3 The Access Process 

Memory conrists of a library of previously observed phenomena (i.e., rituation and be- 
havior descriptions) and a collection of qualitative theories about physical processes (e.g., 
liquid flow), entities (e.g., fluid paths), and general phyrical principles (e.g., mechanical 
coupling).^ Past reasoning traces are summarized by storing with each rituation in an 
observation the instantiated collection of theorie.^ (process definitions, etc.) that were used 
to explain it. This enables all of the relevant information needed to exulain an observation 
to be linked with the observation in memory, without incurring the overhead of storing full, 
detailed explanations. Were such an explanation needed, it could be easily regenerated from 
the available information. For example, the statement Liquid-FIow(b«akerl,vial8 tpip«2) 
would be stored with an observation of liquid flowing f^om b«akerl to vialS, indicating 
an instance of the Liquid-Flov process. 

The problem of access limited to exact feature match is avoided through a two stage 
process. Shared abstractions are used to focus attention on a potentially relevant subset of 

^PHUBAS's domain knowledge is listed in Appendix C. Its set of a priori experiences is listed in 
Appendix D. 
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Figufe 4.5: The PHIIEAS access process and adjacent modules. 
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memory. Each experience in this subset is then compared at a detailed level to the current 
situation using SHE. 

When a new observation is encountered, each bseg is integrated into memory via its 
^ven behavioral abstractions. In doing so, past behaviors sharing those abstractions are 
^Houched", that ia, thcb numeric activation levels are incremented. An extremely simple 
activation scheme is used in PHIIEAS. For ^ach behavioral abstraction describing a newly 
observed bseg, the activation of every existing bseg in memory also described by that be- 
havioral abstraction is incremented by a value of one. If there are no known instances of 
that spediic abstraction, then instances are sought under each more gep'sral node in the 
hierarchy until instances are found. Increasing the level of abstraction produces greater 
distance between the current situation and candidate instaoces. Thus, the amount of 
activation added to a discovered bseg should decrease with distance from the observed be- 
havioral classification. In PHIIEAS, the activation wdght is dropped by one-tenth for each 
node traversed. For example, an observation of evaporation might be dasrified as a type 
of f luidic**phas«*chang«-nDV«, while a past observation of dissolving might be classi- 
fied as a type of solid-phat •-cliang«-]Bov«. Both are types of phas^-chang^^movement. 
Thus, installing evaporation under f luidic*phasa*char^6-]iiov« would cause the activar 
tion level of dissolving to be incremented by 0.8. An initial set of candidate analogues is 
then obtained by collecting the N most activated behaviors, where iNT is a specified beam 
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«2e (currently 15).' These behaviors axe those experiences sharing the greatest number of 
abstract, characterizing features. Temporal subsumption is checked during this operation, 
80 that a general bseg is preferred over the set of bsegs it summarizes. 

Each prior experience in the initial candidate set is then inspected more carefully by 
matching its detailed structural and behavioral description with the current situation. SHE 
is used for this operation, producing an initial partial mapping and an evaluation score for 
each candidate analogue experience. This partial mapping provides an indication of what 
objects and quantities correspond by virtue of their behavioral similarity and will serve as 
an important source of constraint during the mapping process. 

The match indicates where the behaviors correspond and thus what portion of the 
analogue behavior should be considered relevant. The problem r£ relevant theory selection 
is solred by retrieving only those domain theories that had been used to explain the matched 
portions of the analogue situation. Each bseg indicates what processes were active during 
that state. Thus, if the current observation only matches a subset of the bsegs in the 
analogue observation, only the relevant process models are used. 

The candidates are then ordered according to SME's evaluation score and proposed one 
at a time as results from the access task of PHUEAS' global agenda. If more candidates 
are required beyond the initial N, the access task may always resume where it left off and 
examine those remaining. However, the ability to resume a suspended access task is not 
utilized at the present time. 

4.3 Disappearing Alcohol Example 

The access mechanism will now be reviewed in the context of a detailed example. In this 
example, PHUEAS is given time-ordered measurements of a situation in which the amount of 
alcohol sitting in au open beaker is seen to continually decrease, with the beaker eventually 
empty (Figuie 4.1). The complete observation description appears in Figure 4.2. The 
system begins with knowledge of eight processes - liquid flow, liquid drain (to constantly 
empty an ideal sink), heat flow, boiling, heat-replenish (e.g., to constantly maintain the 
heat of a itove), dissolve, motion, and spring-applied force. It also has a database of 
physical observations fully explained by these processes. 

The problem is to propose an explanation for this observation, given PHIHEAS' current 
breadth of knowledge. First, the behavioral abstractions describing the observation are 
used to probe memory. In this case, there ar-; three. First, matter-movemant indicates 

•Due to PHZIEAS' currently impoirerished aet of experiential knowledge, the bean use of 15 is not a 
fMtor in any of the examples disciufed in this thesis. TypicaUy, only 3 4 analogues are retrieved. 
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that matter is moving, for the alcohol is disappearing. This directly active 'es two different 
liquid flow observations, one of liquid draining from a leaky cup a'^d the other of flow 
between two containers. Second, continuous-movement indicates that the movement is 
happening gradually over the total mass of the alcohol, as opposed to the alcohol moving 
as a whole. This activates a number of experiences: a leaky cup, liquid flow between two 
containers, salt dissolving in water, and a pot of boiling water. Finally, the behavior is 
monotonic, as opposed to cyclic. This activates the leaky cup experience, dissolving, and 
boiling. Taking all of those that have a non-zero activation level (which is less than the 
beam size), we have the leaky cup situation, salt dissolving in water, a pot of boiling water, 
and liquid flow between two containers. 

The second stage of access examines this subset of memory in more detail, using SME to 
establish and evaluate each match. SME produces the following observation and evaluation 
score pairs: 

BOILING-BEHAVIOR (15.7) The alcohol disappearing is similar to what would happen if 
it were boiling. This match is shown in Figure 4.6. 

LIQUID-DRAINING-BEHAVIOR (14.9) In this behavior, a leaky cup is found to correspond 
to the beaker and the water leaking out corresponds to the disappearing alcohol. 

DISSOLVE-BEHAVIOR (12.6) Salt dissolving in water behaves similarly to the disappear- 
ing alcohol. 

2-CONTAINER-LF (11.1) Water flowing out of one container and into another until pres- 
sure equilibrium is reached has a similar behavior, although it ends with liquid still 
present in the source container. 

At this point, the access task returns the boiling analogue as the best behavioral and 
structural match for the current situation and passes it to a mapping task. The remaining 
three analogues are each assigned a mapping task as well, with their match scores used 
to establish priority. The next few chapters primarily follow the development of the "dis- 
solving" hypothesis, because it demonstrates the greatest percentage of ideas in the the.! ;. 
Chapter 8 concludes the example by discussing PHIKEAS's treatment of each hypothesis and 
how it ultimately selects boiling as the best explanation. 

4.4 Perspective 

Access is a difficult problem. Part of the dificulty is methodological, in that it is hard to 
build programs with the same volume and variety of experience we receive. 
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Duappearing Alcohol Hiaiory 



(Situation boillag-biag (Situation alcohol-going 

(Sot (Constant (Trnqporaturo ■tovt9)) (set (Oecroailng (Anount-of alcoholl)) 

(Constant (Tanp^ratnr* btatorl) (Constant 

(Eqoal-to (A (Tamparatnra stov«9)) (Chango-aatt (Aaonnt-of alcoholl))) 

(A (TMVeratnra bvatarl))) (Graatar-than (A (Anount-of alcoholl)) 

(6r«ator-than (A (Aaoont-of bvatarl)) zero))) 
zoro) 

(Constant (Chango-Rato (Aaonnt-of bvatarl))) 
(Constant (Chango-Rato (Aaonnt-of bstoaal))) 
(Docroasing (Aaomit-of bvatarl)) 
(Xncroasing (Aaomit-of bsteaal)))) 

(Situation boiling-dry (Situation alcohol-dry 

(Sat (Constant (Aaount-of bvatarl)) (Sot (Constant (Aaount-of alcoholl)) 

(E<inal-to (A (Aaount-of bvatarl)) (Equal-to (A (A«ount-of al-oholl)) 

zaro))) zero))) 
(Grantor-than (A (Aaount-of bstoaal)) 
zero))) 

(Noets situationl situation2) (Meats situationl situation2) 

Entity le Quantity Correspondences 

bvatarl «-» alcoholl 

panT «-» b«akor2 

vator «-» alcohol 

Aaount-of «-» Aaount-of 

Changa-Bato «-» Chango-Rate 

boiling-bsog «-» alcohol-going 

boiling-dry alcohol-dry 

Figure 4.6: SMEc5J# match between the disappearing alcohol and a boiling pan of water. 



Several factors have been left out of PHIHEAS that seem important. First, the more novel 
a situation and lest it is readily visible, the harder recognition. Data must be collected and 
analyzed, patterns sought, and overall familiarity increased. At some point, enough ^-t- 
tails may be in place to trigger recognition, reorganization, and insight (Dreistadt, 1968). 
In PHIIEAS, the access task is greatly rmplified, with enough information provided by 
the user to find relevant candidate analogues. For example, the fandamental clue leading 
to the caloric theory of heat, namely that the temperatures reach equality (Roller, 1961; 
Wiser k Carey, 1983), is part of the behavioral description given to PHIHEAS. Currently, 
there is no mechanism in PHIHEAS to model this gradual buildup of experiences. See 
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Figure 4.7: Two views of the moon orbiting the earth. 



(Dreistadt, 1968, 1969) for a discussion of this phenomenon in human reasoning, and 
(Langley k Jones, 1988) for an initial computational framework. 

A second problem is the translation of raw input data to qualitative patterns (e.g., 
sinusoidal oscillation). PEIHEAS has the ability (using Decoste's (1989) measurtment in- 
Urprttation system) to translate a continuous, real valued measurement sequence into a 
succession of equivalent states capturing each quantity's qualitative value and derivative. 
However, it does not have the ability to automatically analyze the data and identify the 
appropriate behavioral abstractions (e.g., sinusoidal oscillation). For this reason, actual 
real-valued measurement sequences are not currently used. At some point this should be 
automated. 

Just as important as behavioral abstraction recognition is behavioral abstraction devel- 
opment. The abstractions used in PHINEAS are theory laden and greatly simplify memory 
indexing. What is needed is a richer vocabulary for describing behaviors and situations. 

Finally, the ability to change perspective on a set of data appears crucial to successful 
scientific explanation. For example, consider attempting to explain the moon's motion 
around the earth. Different perspectives can lead to different answers. In Figure 4.7(a). 
the moon is viewed from a polar coordinate system, which supports an analogy with a ball 
spun about on a string. However, a cartesian view of the situation, with x and y coordinates 
oscillating (Figure 4.7(b)), supports a (rather odd) analogy with a pair of spring systems.^ 

*I owe tbii doable-spring possibilitj to John Collins. 
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PHIHEAS currently does not reason about multiple perspectives of a situation, a difficult 
modeling problen. in general (see (Falkenhainer ic Forbus, 1988) for initial work in this 
area). 
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Chapter 5 

Mapping and Transfer 



Mapping and transfer form the center of the analogy process. Mapping represents the 
primary analogical act: establish a complete set of correspondences and propose any infer- 
ences these correspondences sanction. Transfer represents the first evaluation phase for the 
results of mapping: attempt to transfer the inferences mapping proposes into the target 
domain. This chapter discusses these two intimately related processes. It begins by fram- 
ing their respective roles in the larger analogy process and how they may interact. It then 
discusses each, along with a detailed example demonstrating their operation. 

5.1 Overview 

The term ^mapping" used in reference to the correspondence phase of analogy has its 
origins in formal mathematics and may be defined as follows: 

Definition 5.1 (Analogical Mapping) Take B to represent the set of individual items 
from the base domain description (i.e., predicates, expressions, and entities), and T to 
represent the set of individual items from ihe target domain description. An analogical 
mapping, M: B-^T^ is a partial function ihat associates elements ofBto elements ofT.^ 

Thus, an analogical mapping identifies a correspondence between some elements of 
the base domain description and elements of the target domain description. This aspect of 
mapping is sometimes called its match component. The mapping may also sanction a set of 
candidate inferencer. base information that may plausibly be applied to the target domain. 
This aspect of mapping is sometimes called its carryot;er component (Centner, 1988). The 

^This is equivalent to definitioiis^ appearing in (Centner, 1983; Indarkhya, 1987). 
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use of the tenn "carryovei" is perhaps unfortunate, for it seems to imply that it results in 
actual, usable target inferences. However, rather than replace the term, I will afford it a 
niore precise meaning: 

Definition 5.2 Given 1$, the set of base information proposed as inferable for the target, 
the carryover operation produces the set of candidate inferences CI sanctioned by mapping 
by applying the existing mapping function to 1$: 

CI=M{Xb) 

Candidate inferences are hypotheses which must pass a series of evaluative processes 
before being accepted as holding for the target domain. A distinction may be made between 
usable inferences and useful oi adopted ixdetences. Usable inferences are operational, that is, 
they apply predicates in a manner consistent with their intended semantics, they produce 
no inunediate contradictions, and there are no syntactic holes (i.e., unknown objects are 
either foimd or conjectured). Useful or adopted inferences have been tested through farther 
evaluation or problem solving and found to be "satisfactory" under some criterion. 

Importantly, the set of candidate inferences proposed by mapping need not be usable. 
Thus, the first stage in the evaluative processes is transfer^ which attempts to form a work- 
ing hypothesis from a potentially fragmented candidate inference. Significant adaptation 
may be required before a candidate inference represents usable target domain information. 
First, since base relations are being imported, allowing cross-domain analogies means they 
may apply predicates to objects or propositions other than their conventional referents. 
Additionally, the inferences may suggest facts that contradict what is explicitly known 
about the target. Substitutes for such expressions must be found or created. Second, these 
inferences may also contain unknown, anticipated objects: slots occupied in the base rep- 
resentation by objects that had no correspondent in the match. A corresponding target 
object must be foimd, or the existence of the unknown object postulated. 

An overview of the mapping and transfer phases is shown in Figure 5.1. 

In addition to base and target descriptions, the mapping box allows contextual knowl- 
edge, if supplied, to infiuence its operation. Contextual knowledge may include the current 
plans and goals of the performance element or any associations already established. These 
associations may arise either from teacher supplied hints or prior processing, su^h as access. 

There may be strong interplay between mapping and attempts during transfer to j1 
a coherent conclusion from its statement of correspondence. This is in part due to the non- 
monotonic binding problem discussed in Chapter 2. In realistic memories, mapping will 
be operating on a subset of all that is inferrably known about the base and target. Since 
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Figtire 5.1: Overview of the mapping and transfer phases of the analogy process. 



a candidate inference is with respect to the subset of the base or target being processed, 
it need not represent new knowledge. Rather, it may indicate places where additional 
knowledge should be retrieved to help complete the mapping. 

Transfer conducts focused probes into memory to seek more information about places 
where the similarity match was incomplete. If additional knowledge is found, transfer will 
augment the existing base and target descriptions and iterate back to mapping, to see if 
the new information will affect the overall mapping. Mapping and transfer combine to 
form a map and analyze cycle to provide focus to the analogy process. Were everything 
possibly known about the base and target retrieved prior to mapping, a great deal of time 
might be spent on unconstrained inferendng. Conversely, were no match performed and 
relevant base information simply carried into the target, little focus would be available 
for transforming the candidate inference into useful target information. By initiating the 
match on what appears to be relevant from inmxediatdy available information, transfer 
may focus on seeking the spediic information the mapping indicates is lacking. 

Typically, mapping and transfer will proceed in a simple sequential manner. In within- 
domain analogies, transfer may provide little service beyond blessing the proposed infer- 
ences as usable. In cross-domain analogies, particularly when very little is known about 
the target domain, transfer may require many map and analyze cycles. 
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5.2 Contextual Structure-Mapping 



Due to the importance of mapping in elaborating the various aspects of the analogy process, 
contextual structure-mapping was discussed in Chapter 2. In this chapter, it is placed 
within PHINEAS' overall process by way of example. 

5.2.1 Example: Disappearing Alcohol 

The second stage in PHIVEAS is theory generation: f/roduce a fully operational initial hy- 
pothesis about the current observation from an analogue retrieved during access. This has 
two components, mapping and transfer, of which the first will be demonstrated here. 
The mapping operation in PHINEAS consists of three steps: 

1. Determine base and target representations. 

2. Declare sanctioned pairings. 

3. Invoke SHE. 

Before mapping may commence, we must determine the base and target inputs to 
mapping. Recall that past reasoning traces are summarized by storing with each bseg 
in an observation the instantiated collection of theories (process definitions, etc.) that 
were used to explain it. The problem of knowing what to map is solved by retrieving the 
relevant domain theory which led to prior tmderstanding of the base observation. If the 
current observation only matches a subset of the bsegs in the old observation, only the 
relevant theories for that subset are used as the base description.^ The current observation 
description (the target) consists solely of its original scenario description (e.g., structural 
relationships and objects' properties). 

For example^ the previous chapter described how an observation of alcohol disappearing 
from an open container triggered a reminding of four similar experiences in the follow- 
ing order: liquid*draining-behavior, boiling-behavior, dissolve-behavior, and 2-coxitainer-lf. 
Resuming that example with the dissolving analogue, we find that the two behaviors fully 
matched, so both dissolving bsegs are relevant.^ The first dissolving bseg, dissolving, 

^If PHIIEiS supported spatial decomposition of behaviors, selection could be further specialised. Rather 
than sU information about a temporal interval, information about specific behavioral aspects of that 
temporal interval could be selected. 

''fiillj matched" means that for each bseg in the base behavior, there is a unique bseg in the target 
behavior that SHE foond to possess similar aspects. Recall that due to relational groups, two bsegs need 
not ezhaiutively match on all relations. 
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described the disappearance of saltl and was explained by a solution entity and an active 
dissolving process. The second bseg, di8tolv«-8topp«d, described a state with no salt left 
in the glass and dissolving inactive. This explanation summary is represented using the 
B*Kzplaias((Aeorie«,6sey) form. Both base and target representations for this example are 
shown in Figure 5.2.^ Two types of forms appear as elements of iheori^r. 

(Proc«gg-D«finltion proceMM-name instance-name (Zmpli^g conditions effects)) 
(Pack«t*D«f inltioB header body) 

The Pack«t-D«f inltionis used to represent a packet of information (e.g., predicate schema, 
QP theory defEntity). Objects for which the packet header holds have the properties listed 
in the body ascribed to them. 

The second step to theory mapping explicitly declares the partial mapping determined 
during access. Access established that certain properties from the two situations behave 
in the same way, and correspondences between entities or between their quantities (e.g., 
Pressure and Tenp^raturs) were noted. Prior to invoking SHE, each cf these correspon- 
dences is declared as a sanctioned pairing^ requiring SNE's match rules to respect these 
established pairings. In the disappearing alcohol example, there are two: saltl maps to 
aleoholl and aMount^of maps to aaount--of . 

The final step to theory mapping is the actual mapping. SHE is given the base and 
target descriptions and the set of sanctioned pairings. Adilitionally, the SVEcsm rule file 
contains the provides relevant inference rule, which favors mappings supporting the infer- 
ence B-IxplMiMiiheories^targeUbehavior). SME's output is shown in Figure 5.3. The initial 
explanation of the disappearing alcohol observation appears in the candidate inferences 
field. This model states that there is some process analogous to dissolvs which is re- 
moving the alcohol at a rate proportional to its surface area. Note that the model is not 
operational at this stage. First, it contains the unknown (:skolsm watsrl), which indi- 
cates there is no object in the alcohol scenario corresponding to waterl in the dissolving 
scenario.' Additionally, it proposes the condition (Solid aleoholl) on the alcohol, which 
clearly is false. Evaluation and adaptation is required before the model will be usable. 

This example demonstrates several points. First, the mapping is composed almost en- 
tirely of candidate inferences, since the system had no prior model of evaporation. Hence, 

^The base representation contains the expression (Qprop quantityi guanl^y^}. This is QP theory 
syntax indicating that tuaniUyi is quaiUaiively proporiionai to fuantity%. All else being equal, quaniityi 
increases when fuantityt mereases and decrecises when quantityi decreases. Qprop* indicates invenely 
proportional to* 

'The other skolem objects appearing in the candidate inference are expected by PHIIEAS and han- 
dled by another mechanism. For example, when a new process is being coigectared, there will not be a 
correspondent for the process name or process instance name. 
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Base: Dissolving explanation 



(B-EXPUIIS 

(SET (PACXET-DEFIIZTIOI (SOLOTICI VATERl) 

(SET (qUAITITT (COICEITRATIOI VATERl)) 

(qUAniTT (SATURATIOI-POIIT VATERl)) 

(lot (less-thai (a (coiceitratioi vaterl)) zero)) 
(lot (lbss-taai (a (satoratioi-poiit vaterl)) zero)))) 
(process-defhitioi dissolve pii 

(IMPLIES 

(AID (IIDITIDUAL SALTl (COIDITIOMS (SOLID SALTl) (SOLUBLE SALTl))) 
(IIDITIDm VATERl 

(conmois (solutioi vateri) 

(DQfERSED-II SALTl VATER:*))) 
(SOLOBLE-U SALTl VATERl) 
(GREATER-THAI (A (AMOUIT-OF iALTl)) ZERO)) 
(AID (QUAITITT (DISSOLVE-RATS PII)) 

(qPROP (DISSOLVB-UTS PII) (SURFACE-AREA SALTl)) 
(GREATER-THAI (A (DISSOLVE-RATE PII)) ZERO) 
(CTRAIS (AMOUIT-OF SALTl) (COICEITRATIOI VATERl) 
(A (DISSOLVE-RATE PII))))))) 

DISSOLVE-BEHAVIOR) 

Tkrget: Disappearing alcohol scenario 

(SUBSTAICE ALCOHOL) 
(COITAIIED-LiqUID ALCOHOLl) 
(COITAIIEE BEAKER2) 
(COITAIIER-OF ALCOHOLl BEAXER2) 
(8UBSTAICE-0F ALCOHOLl ALCOHOL) 
(OPEI BEAKSR2) 
(BEAKER BEAKER2) 

Figure 5.2: Base and target representations in initial theory mapping of the dissolving 
explanation for the disappearing alcohol observation. 
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SHE Version 2E 

Analogical Match fron DISSOLVE-BmVIOR-THEORT to OPEI-ALCOHOL-BEHAYIOR-THEORT. 
Eula Fila: ■n.rulai 



# MB*! I # Gnapi I 1st »2nd, Worst I Ral6roiq)s I 

2 1 11 0.99 / 0.99 / 0.99 I ACTITE I 



Total Run Tina: 0 Ninntas, 0.509 Saconda 
BUS Run Tina: 0 Ninutas, 0.032 Saconda 

Bait Gbi^i: { 1 } 

Gaap «1: (SALTl ALCOHOLl) (AlfOniT-OF-299 AMOUIT-OF-SZO) 
Vaight: 0»9920 
Candidata Infarancai: 
(B-SXPUIIS 

(SET (PACKET-DEFIimOI (SOLDTIOI (:SXOLEN VATERl)) 

(SET (qUAITITT (COICEITRATIOI (:SXOLEN VATERl))) 

(q^AITITY (SATURATIOI-POIIT (liTKOLEM VATERl))) 
(lOT (LETS-THAI (A (COICEITRATIOI (:SKOLEM VATERl))) ZERO)) 
(MOT (LESS-THAI (A (SATORATIOI-POIIT (:SKOLEM VATERl))) 
ZERO)))) 

(PROCESS-DEFXimOI (:SKOLEN DISSOLTB) (:SKOLEN PIl) 
(IMPLIES 

(AID (HDIVIDUAL ALCOHOLl 

(COIDITIOIS (SOLID ALCOHOLl) (SOLUBLE ALCOHOLl))) 
(HDITIDUAL (:SK0LEN VATERl) 

(COIDITIOIS (SOLDTIOI (:SKOLEN VATERl)) 

(IMMERSED- JI ALCOHOLl (:SKOLEM VATERl)))) 
(SOLUBLE-n iLCOHOLl (:SKOLEN VATERl)) 
(GREATER-THAI (I AM0(niT-0F-320) ZERO)) 
(AID (QUAITITT (DISSOLTE-RATB (tSKOLEM PIl))) 

(qPROP (DISSOLVE-RATE (:SKOLEM PIl)) (SURFACE-AREA ALCOHOLl)) 
(GREATER-THAI (A (DISSOLVE-RATE (:SKOLEM PIl))) ZERO) 
(CTRAIS AM0UIT-0F-S20 (COICEITRATIOI (:SXOLEM VATERl)) 
(A (DISSOLVE-RATE (:SKOLEN PIl)))))))) 

OPEI-ALCOHOL-BEHAVIOR) 

Figuie 5.3: SNEcsJf oatpat mapping dissolving theories into the disappearing alcohol situ- 
ation. An analogically inferred candidate model of the alcohol ^dissolving'' appears in the 
candidate inference slot of the gmap. 
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the model was constructed by analogy rather than augmented by analogy. This shows the 
power of she's candidate inference mechanism. Second, the entity and function correspon- 
dences provided by the behavioral similarity provide signific9nt constraint for carrying over 
the explanation. SME's rule-based archiiectureis critical to this operation: the SVEcsm rules 
only allow hypotheses consistent with the specific entity and quantity correspondences pre- 
viously established. Entities and quantities left without a match after the accessing stage 
are still allowed to match other unmatched entities and quantities. 

5.3 Transfer 

The evaluative processes have several phases. The first of these is transfer. Transfer is 
concerned with importing candidate inferences into the target domain and making them 
operational This centers around two issues: (1) ensuring consistent expression use and (2) 
resolving skolem objects. Since creating new predicates and conjecturing unknown objects 
is undesirable, effort is aimed at searching through memory to find possible resolutions 
of these points of conflict. Objects satisfying an unknown's stated conditions yet absent 
from the original target description may be found. Alternate bclicfc or predicates may 
be found as substitutes for inconsistent expressions, which may in tur. lead to previously 
unconsidered objects satisfying these alternate conditions. If new information is found, 
mapping is repeated to see how it affects the overall mapping. It could lead to a new 
mapping as opposed to an extension (c.f. the N-M binding problem). 

This section begins by discussing methods that address specific aspects of th^ expression 
consistency and skolem object issues. It then uses these as primitive operations in describing 
the transfer procecs as a whole and how it may interact with mapping. It concludes with 
a continuation of the disappearing alcohol example to demonstrate the transfer procedure. 

5.3.1 Inference engines and abductive retrieval 

PHINEAS' domain knowledge, beliefs, and inferences rules are maintained using an assumption- 
based truth maintenance system (ATMS) (de Kleer, 1986a) and an associated problem 
solver (AIRE), using the problem-solver protocol described by deKleer (1986b, 1986a).^ 
The ATMS is crucial to PHIHEAS' operation, as it allows simultaneous consideration of mul- 
tiple, mutually inconsistent theories about the world. In an ATMS, a context is a set of 
assumptions combined with the set of all belie& derivable from those assumptions. When 

^The ATMS and afsociated nde engine, ATRB, used in PHIIEiS are enhanced versions of programs written 
by Ken Forbos for his Building Problem Solven coarse. 
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PHUEAS it first initialized, its a priori domain knowledge is taken to be universally true. 
When & XLtw theory is proposed i all of its consequents are made dependent upon the as- 
sumption that the theory holds. Reasoning about the consequences of a partictilar theory 
Ti is performed reasoning in the context of the assumption loldsCT^). 

In support of the various transfer operations, the forward chaining ATRE rule system 
is paired with an iiMuctive rttriever. This is a backward chaining, breadth-first prover 
aldn to the deductive retriever described in (Chamiak et al., 1980), with one important 
difference. Abductive assumptions are allowed when deductive reasoning over existing 
beliefiB is insufficient to derive a specified request pattern. Stated formally, the abductive 
retriever's task given wff p is to find a substitution 9 associating all variables in p to known 
entities such that there exists an ATMS context in which (Consistent p0) is believed. 

When a new datum is entered in the database, ATRE exhaustively runs aJ rules made 
executable by the datum's presence in a forward manner. Thus, if a proposition is not 
believed true or false, its status is not derivable from existing beliefs. When the abductive 
retriever is invoked on a request pattern, it alternates between checking the belief status of 
the current goal node and backward chaining if the status of that goal node is unknown. 
Two operations remove the apparent T^uaduarj with ATRE. First, if all subgoal attempts 
to show a current goal fail, that goal is assumed true if it is consistent to do so. A goal may 
only be assumed if it is a ground wff (i.e., no variables). Lo^cal consistency is constrained 
by inference rules, the typed logic, and by closed world assumptions (Reiter, 1978). For 
example, all spatial relationships, such as Touching, are asstmied to be either known 
or &lse. Second, if only a subset of a coigunctive goal's elements may be shown, the 
remaining conjuncts are assumed, contingent on their joint consistency (i.e., (Consiit#nt 
Ci A . • . A Cy) is true, iivhere {Ci A ... A Cjf] denotes the elements of a conjunctive goal of 
size N). Assumptions, explanation dependencies, and determination of joint consistency 
are handled through normal ATHS operations. For example, given the goal to show 



with -P(a), Q{b)j and It{b) believed true, the abductive retriever would return two possi- 



Such ^assume if consistent" is clearly inadequate as a general theory of abduction, but it 



P(?«) A Q{?x) A R{?x) 



bilities: 



Binding 
1: (rX.a) 
2: (IX. b) 





has proved adequate for PHIHEAS (see also (Chamiak, 1988) for a similar approach) J A 
depth bound (default 4) is used to limit rule chaining depth. Monitors are used to return 
sets of answers one at a time* and control communication between goals and subgoals 
(they arc derived from Chamiak et al*s (1980) generators). Coordination of conjunctive 
subgoals to prevent the N-M binding problem is a somewhat complicated process astray of 
the primary topic of this thesis. The detaUs are described in Appendix A. 

5«3.2 Expression consbtency 

Candidate inferences represent base knowledge applied to target referents. However, vocab- 
ulary that was appropriate for the base may not be appropriate for the target case, either 
due to semantic changes in crossing domains or alternate characteristics of the situations. 
Therefore, each expression proposed as a candidate inference about the target is checked for 
consistency in the target environment and adapted if necessary. Three operations support 
this: show consistency^ retrieve oHemate expresvion^ and create predicate. 

5.S.2.1 Show consistent 

Semantic applicability of a predicate may be evaluated by testing if the proposed target in- 
stantiation is consistent with the type constraints declared for its argaments (Burstcin, 1983). 
However, sorted logic alone is insufficient to maintain conristency. The new predicate in- 
stance may violate what is known about things described by that predicate. There isnH 
much that can be said about the single argument to Fluid-Patn except that it must be a 
physical object. Yet, we know a solid metal bar should not b^^ considered a Fluid-Path, an 
inference proposed if heat flowing through the bar is explained via analogy to liquid flow. To 
properly tell if the proposed instantiation has the requisite properties, it is recessary to test 
the expression itself for conristency with existing knowledge. Thus, the show-consistent 
operation uses three tests to determine consistency: (1) each argument is conristent with 
the declared type of the porition it appears in, (2) the proportion is not sssumably false 
by dosed-woild assumption, and (3) the proposition's negation is not provably tme. The 
8hov*consist«nt operation is often denoted by the goal to show 

(Gonsistttnt expression) 

^As aa aside, I would higUj recommend temporary installation of such a mechanism into any large 
reasoning system. Holes in one's '^complete'* domain theory become amusingly clear when wild assumptions 
start being made based on what the domain theory deems consistent. 

*If a direct fetch on the database returns moltifde possible instantiations, these possibilities are returned 
as a set rather than one at a time. 
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When thov-consittwt is given an expression containing variables, only argument 
t]rpe consistency for ground arguments may be determined (PHZVEAS doesn't support pure 
quantified reasoning). The other two consistency tests are not attempted. For example, 
(InMrs^d-in alcoholl Iz) is inconsistent for any instantiation of ?x, since alcoholl 
occupies a slot limited to solids. 

A proposed expression will b* called singularly consistent if it is consistent with existing 
knowledge when considered in isolation.^ This is distinguished from global consistenqfj 
which examines the interrelations of the entire inference set and its ability to provide 
useful and coherent problem solving performance. Global consistency is the province of 
post-transfer evduation, and is discussed in the next chapter. 

Retrieve alternate expression 

Singular inconsistency offers two options. A new predicate mar be created or an alter- 
nate target correspondent sought. Of course, the latter is preferable to the former »ince 
it prevents unconstrained spawning of new vocabulary and provides access to knowledge 
attached to a known relation. Thus, singular inconsistency establishes the goal of seek- 
ing an alternate predicate that may fulfill the intent of the original base expression and 
form an expression that is either true or consistently assumable. This operation is called 
retrieve-altemate-e3q)re88ion and has two components, corresponding to the two 
u.ethods of mapping non-identical predicates: paired predicates are frmctionally analogous 
or they represent a minimal ascension of the predicate hierarchy. 

A functionally analogous expression is found by determining why the base expression 
was present in the base description and searching for a target expression that can provide 
the same service. The cache slot of the base's process description supplies the necessary 
information by stating what consequent prerequisites each antecedent satisfies: 

(Satisfies antecedent (Prereq consequent prtrtquisite)) 

The general prerequisites the expression's baoe correspondent satisfied are retrieved and 
mapped to the target by application of the existing mapping function. Depth-bound, 
exhaustive backchaining is then used to locate all known target propositions deductively 
supporting any of these prerequisites. For each one found, the expression 

(Supports target-proposition prerequisite) 

^Greiner (1988) also uses general consistency to test aptness of an inference, but as a substitute for 
mapping to select target coxrespondcints. Mappmg consists of attempting to reinstantiate a base abstraction 
in the target enTiionment, without fbrther reference to the base instance. I pre&r to attempt to use base 
Toe^bulary, as this eliminates a lot of needless search if singularly consistent (typical of within-domain 
analogies) and provides a strong clue when searching for an alternative. 
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is retunied. Additionally, a supports proposition for the original base version of this ex- 
pression is returned. 

If a functionally analogous expression cannot be found, then the predicate hierarchy 
is searched for a nearest neighbor. This is the minimal ascension operation. The ISA 
hierarchy is climbed until a predicate is found having argument type constraints consistent 
with their proposed instantiation.^^ All predicates below this pdnt are then examined. Il 
consistent instantiations exist among these descendants, the following heuristic ordering is 
used to select a single candidate proposition: 

1. The proposition is true. 

2. Each argument satisfies the type declared for its position and the proposition is 
consistent. 

3. Each argument is consistent tvith the type declared for its position and the proposition 
is consistent. 

Within each category, the predicate closest to the original inconsistent predicate (in terms 
of number of nodes between them) is chosen. 

For example, if (Contained- in taltl watar) is present in a candidat'^ inference about 
dissolving, the mapping component would be able to recognize that (DiifOlTtd-in saltl 
wat«r) is more appropriate. Why would the mapper use the base relation Contained-in 
if the relation Dissolvad-in is known? Even if (Dissolvad-in saltl water) is true 
in the target instance, it may have been absent from the varget description ^ven to the 
mapper and thus prevented from matching. Alternatively, the mapping component could 
not have mapped Contained-in to Dissolved-inif the instantiated expression had never 
been formed before. 

5.S.2.S Create predicate 

If a predicate cannot be mapped as is, and no suitable correspondent can be found, ^hen 
a new predicate is built using the creat«i-predicata operation. The ISA hier^.chy is 
climbed until a predicate is found having argument type constraints consistent mth their 
proposed instantiation. A new child is then created below this point (Burstein, 1985). The 
new predicate inherits all of the original's properties (e.g., commutative, n-ary, relational 
group), with each of the new predicate's argument types either remaining the same if 
consistent or changing to a known attribute of the argument occupying that position. 

^^he climh uuiU comisieni operation is taken from Burstein 's (1985) method for creating new predicates. 
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In the case of Fluid- Path being applied to a heat cor ducting metal bar, a new child of 
the more general Path predicate would be created (e.g., Path-18). 

5.3.3 Unknown objects 

When a candidate inference contains slots occupied b} known objects {skolem objects)^ 
suitable target objects must be found or their existence conjectured. There are four options: 

1. General physical knowledge. Search for a known item that may actually be the item 
in question. This is a component of the general abduction problem and is dealt with 
by the abductive retriever. 

2. Anaiogaus conditions. Search for a known item that satisfies constraints considered 
functionally analogous to those conjectured for the skolem object. 

3. Directed etperimentation. Experiments may be Ised to empirically determine what 
the missing entity is (e.g., Rajamoney et al., 1985). 

4. Hypothesize existence. The object's existence may simply be assumed and represented 
by a skolem constant. If it is a known type of object, then a standard assumption 
mechanism as used in abduction will suffice. Otherwise, the exit ence of some new, 
hypothetical entity may be assumed, as was done when ether was coujecti ;d as a 
medium for light waves. 

The first and last options are described in this section. The second option was addressed 
above, nnder the heading of alternate expression retrieval. The third option is briefly 
reviewed later in this chapter and demonstrated in (Falkenhainer & Rajamoney, 1988). 

5.3.3.1 Abductive retrieve 

The abductiv^*retriava operation seeks known objects that may consistently fill the role 
of a given skolem object. It begins by collecting all of the conditions the skolem must satisfy. 
For transfer of proposed QP theory processes, these conditioning relations are defined to be 
all propositions containing the skolem object and appearing as an antecedent in one of the 
proposed process definitions. These conditions are then passed to the abductive retriever 
as a dingle conjunctive goal. Since skolem objects are expressed as existentially quantified 
variables during this operation, r turned instantiations represent entities assumably equal 
to the unknown skolem object. 
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A branch bound &6 (default 5) is used to limit the number of acceptable possibilities. 
K more than 66 candidates are retrieved, it is assumed that not enough information exists 
to make a decision and the skolem object remains unknown. 

abductiY«-r«tri«Y« is not applied to skolem tokens representing compound objects. 
These are objects fully defined by their constituents and thus are known when all of their 
constituents are known. For example, a specific contained liquid (e.g., C^ontaintd-Liquid 
clD) is uniquely identified by a substance (e.g., (Substance* ol cll wat«r)), in a liquid 
state (e.g., (Liquid cll)), in a particular container (e.g., (CoatainM-ol cll b#ak«rl)). 
If the cll token were not known for this contained liquid, a new token would simply be 
created. 

5.S.3.2 Create entity 

If a skolem object cannot be resolved a new entity token may be created to fill the role. 
New entities are created using cr^ate-entity, which makes a new entity token and then 
analyzes the consistency of its proposed existence. 

Consistency for created entities deviates from the standard consistency operation. First, 
the new token is substituted into the skolem object's N conditions and the status of 

(Consistrat Cx A . . . A C^) 

is determined as described above." However, if the conjunction Ci A. . .aCjv is inconsistent 
using the new entity token, then 

(Vc)-i[Ci(. ..«...) A ... A Cjv(. . .« . . .)] 

is true, where all instances of the new entity are replaced by x. In other words, no such 
object could possibly exist under current belief^. In this case, we axe faced with the problem 
of ronjecturing a new kind of entity as opposed to simply creating a new instance of a 
pla..jible entity type. The specific object conditions participating in the contradiction are 
extracted by examining the data dependency network.^* One of these propositions is then 
modified using crMt«*predicat« to resolve the contradiction. A heuristic preferential 
ordering is used to determine which proposition is selected for replacement: (1) a unary 
proposition having the entity as its only argument or (2) a proposition having the entity 

"Note that the connstency of the C< most be detennined together rather than individaally, nnce each 
atomic sentence may be consistent when considered individually, bat may not be consistent when considered 
in coAjnnction with the other - 1 assumptions. (Chamiak, 1988) makes this point as well. 

^'In ATMS specifics, the minimal set of contradictory auumptions taken from the onion of assuming each 
entity conditi j C|. 
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at its only skolem object. The third ^choose one at random" option is currently grounds 
to fail the analogy, primarily because its repercussions have not been fully considered. 

Conjectured entities have a long history in science and are often at the center of contro- 
versy in a developing theory. For example, an all-pervading ether was long postulated to 
provide a medium for the flow of light waves because other kinds of waves require a medium. 
Consider explaining the temperature changes occurring when a hot brick is immersed in 
cold water, hj analogy to liquid flow. If no knowledge of heat exists, an analogue to the 
flowing liquid is proposed and given the following conditions: 

(Substance fliquid) A (Liquid fliquid) A (Contai&«d*in fliquid brickl) 

Note that no object could ever satisfy this conjunction, given that brickl is a non- 
porous object (i.e., (not (Porous brickl))). creat«-entity detects the contradiction 

(Liquid fliquid) A (CoataiB«d*in fliquid brickl) A (not (Porous brickl)) ^ ± 

and conjectures a new entity having the conditions 

(Substance sk*wat«r-6) A (Sk-plias«*l sk-wat«r*6) 
A (ContaiA«d*in sk*wat«r*6 brickl) 

Sk-phai«*l is a new child predicate to Phas«, which was the inmiediate parent to Liquid 
in the predicate hierarchy. 

5.3.4 The map and analyze cycle 

The transfer task is to create a set of usable target hypotheses from candidate inferences 
proposed by mapping, or reject the analogy if this cannot be accomplished. A decompo- 
sition of this process appears in Figure 5.4. Transfer consists of two phases. The probe 
phase attempts to repair inadequacies of the candidate inferences by looking for alternatives 
within the set of beliefs derivable or assumable from existing knowledge. K everything is 
adequate (all expressions are consistent and there are no skolem objects) or can be repaired 
with current knowledge, then the set of candidate inferences are returned as operational 
target hypotheses. The resolve phase repairs disclosed inadequacies when the probe phase 
fails, by going beyond existing knowledge. This may involve hypothesizing the existence of 
an unknown object or creating a new predicate. 

Throughout the discussion, I will refer to the mapping and the inferences. However, the 
transfer process has the potential to branch (e.g., multiple alternative fillers for a skolem 
object), producing a set of mapping, inferences pairs. The discussion has been simplified 
by considering the operation of a single branch. 
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Figure 5.4: The transfer process 



Transfer uses the set of operations described in the preceding sections and sttmmarized 
in Table 5.1. The algorithm is shown in Table 5.2. The process starts when a new mapping 
is received and the probe phase is entered. 

5.3.4.1 Probe phase 

The probe phase begins by examining each proposed fact in the set of candidate inferences. 
Because of the hierarchical representations being used in conjunction with SHE, I should 

Table 5.1 : Summary of tiansfei operations. 

• ihow-eonsiatcntCezpreMton) {sncee$$ \ failure} 

• r»t!Pi«T«-«lt«niat4>-«xpPM«ion(^P...^) (P'...) 

• cp«at«-pp«dlcat«(^i»...;) ^P' and P' installed 

• abdnetiT«-p«tPi«T«(e2pfe«Mon) {(expression0i,Ai). .Xexpression0s»Aif)} \ faUuve 

• cp«at«-«ntity(#Aofcm-o6jec<,con<ft<ton*) entity-token [eonditioru'] 
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Table 5.2: Transfer algorithm. 



1. Probe phase 

Repeat Until no new information can retrieved or there are no inconsistent expressions 
and no skolem objects: 

(a) Probe expressions 

For every expression S £ CI 

(a) show-eonaistantCf) 

(b) If not consistent, ratriava-altaxnata-axprMsionCf) 

{accept provably true alternates only) 

(b) Probe skolems 

For every skolem object S € skolem-objects(CI) 
Let Cs = {Ss I 55 € CJ and 5 € £5} 

(a) abduetiva-ratriavaCACC^)) 

(b) If retrieval faib, for every £5 € C5 

ratriava-altarnata-axprMsion(£!5) 

(c) If (there exists an expression £ € C7 that is not consistent 
or a skolem object that cannot be identified) 

and new information has been fomid by ratriava-altaxnata-oxprMsion 
then invoke mapping with base and target descriptions augmented with the new in- 
formation. 

2. Resolve phase 

If inconsistent expressions or skolem objects still exist 

(a) Resolve expressions 

For every expression £ € CI that is not consistent 

(a) ratziave-altarnata-axprassionCf) 

(accept any alternate in preferential order (section 5.3.2.2)) 

(b) If retrieval fails, craata-pradieata(£) 

(b) Resolve skdems 

For every skolem object S € skolem-objects(CJ) 
Let Cs = {€3 I 5^ € CJ and 5 € £5} 
eraata-antity (5,^5) 

3. Retnm set of operational target inferences {7n* • •^ri^}, 

where each Iri represents a different permutation of the set ci proposed modifications to 
CI. 
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clarify that what is meant by "proposed fact" depends upon the representation specifics, 
but may be identified as the smallest boolean expressions present in the representation. In 
terms of the predicate calculus, these correspond to all atomic sentences in the candidate 
inferences. Thus, in each QP process description proposed 

(Process-definition name instance^name 

(Implies (And (Individual name (Conditions conditioning-facts)) 
... 

preconditions and quantity conditions 
...) 

(And effects))) 

the expressions to be analyzed are comprised of the conditions placed on each individual, 
the process' preconditions and quantity conditions, and the process' consequent effects. 

During the probe phase, each atomic expression is checked for consistency using 
show-^consistMt. If an expression is inconsistent, an alternative expression is sought, 
using retrittv^-ftltornata-expression. It first seeks functionally analogous expressions 
that are currently believed. If none are found, alternate expressions are sought via minimal 
ascension, and only those currently believed are accepted (recall that minimal ascension 
may return alternates that are merely consistent with what is known). Any alternative, 
believed expressions found are collected and later used to augment the current base and 
target descriptions. 

The probe phase next examines each skoVm object present in the candidate inferences. 
abductiv6-retri«ve is used to seek known objects that may be consistently substituted 
for the unknown object. If retrieval fails, then retrieva-alteznata^axpressionis applied 
to each of the skolem's proposed conditions. This addresses an important component of 
the correspondence problem: kow can a target correspondent for a base object be found 
given only the base object's stated conditions?^^ Unless these conditions have been fully 
mapped, there may be no target correspondent to satisfy them. This is especially true 
of cross-domain analogies ^ in which an analogous pair of objects may have no identical 
characteristics in common. 

If all expressions are consistent and there are no unknown skolem objects remaining, 
transfer is successfully completed. The potentially modified candidate inferences are now 
considered operational for the target domain (recall there may be branching at this point, 

^'Previous analogy systems have failed to address this problem. They either create a new token and 
assume the proposed conditions for it (Winston, 1982), work on constrained within-domain analogies that 
eliminate the problem (Carbonell, 1983a; Kedar-Cabelli, 1988), or don't examine the possibility of skolem 
objects produced by mapping (Burstein, 1983; Greiner, 1988). 
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producing a set of alternate target hypotheses). When this is not accomplished, there are 
two options. If new information was retrieved, transfer returns to the mapping stage, using 
base and target descriptions augmented with the additional information.^^ This may lead 
to an augmented or alternate mapping and possibly new points of discrq)ancy on which to 
focus the transfer process. Otherwise, transfer proceeds to the resolve phase. 

5.S.4.2 Resolve phase ^ 

Intuitively, the resolve phase corresponds to abandoning attempts at using existing knowl- 
edge to resolve candidate inference inadequacies. It begins by reconsidering any remaining 
inconsistent expressions. ratri«v«-*alt«rnate*«xpr«88ion is used to seek a consistent 
alternate predicate. Only the minimal ascension component applies at this point, since 
functionally analogous alternatives would have been found during probing. Unlike the 
probe phase, the resolve phase accepts any alternative predicate suggested, using the three 
preferential orderings described in section 5.3.2.2. If there are no consistent alternate pred- 
icates found, creat#-predicat« is used to create a new expression. 

The resolve phase completes with all remaining skolem objects replaced by new ob- 
ject tokens, using create* ant ity. Transfer ends with the resulting set of usable target 
inferences. 

5.3.5 Making necessary assumptions 

A ^candidate inference'* may be any number of things. Importantly, it need not be a 
simple answer. It will often represent an entire line of reasoning, either in the form of a 
compiled schema or model, an entire proof trace, or a complete planning or design sequence. 
For these to be useful to the performance element, the assumptions upon which they rest 
(e.g., preconditions or premises) jiust explicitly be assumed as true and the analogical 
relationship stated as the basis for the assumption. This is a phase of operation not seen 
in other analogy systems.^* Yet, it is a vital component of any complex reasoning task. 

In PHIIEA8, each candidate theory T about the current observation is associated with 
two nodes, (Believe T) and (Holds T) (see Figure 5.5). This results in dependencies 
emanating from T in opposite directions.^ (Believe T) is supported by those assumptions 
upon which T depends, which are all the assumptions made in the course of deriving T (e.g., 

^^la the eurreat implementation, only one repetition of mapping is supported. This is due to time 
constraints on program development, rather than a serious technical problem. It has been sufficient for all 
examines examined to date. 

^'(Bnrstein, 1985) may be considered an exception, as a record of past mappings it maintained. 
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{Pi, Pj 






(Believe Ti^i^i) 




(Holds Ti-i) 



(Believe TJ.i) 



(Revision z ^y) 




(Believe Ti) (Emap hi U) (Emap 6^ t,) (Conflict Ti-i Hj) 
(B-Map Wr) 

Figure 5.5: Prototypical justification lattice used in PEINEAS to support analogical reason- 
ing. Ti represents a proposed theory about an observed behavior history Hj. A theory may 
be composed of process definitions i\ and entity definitions £{. 



the behavioral analogy, additional modifications made during transfer, or prior theories of 
which T is a revision). (Holds T), on the other hand, is an assumption upon which all 
components and consequences of T depend. 

5.3.6 Example: Disappearing Alcohol 

Figure 5.8 repeats the candidate inference proposed by mapping dissolving into the disap- 
pearing alcohol situation. As previously noted, there are a ntunber of problems with the 
inference as it stands. Entering the probe phase of transfer, the first task is to examine the 
consistency of each proposed expression. The following are found to be inconsistent: 

(Solid alcoholl) Alcohol is known to b«^ a liquid, which contradicts the taxonomy that 
something can only be one of {solid, liquid, gas} at a time. 

(Soluble alcoholl) The argument to Soluble must be a solid. 

(l2Dmersed*in alcoholl (:skolem waterl)) The first argument to Ismersed-in must 
be a solid. 

(Soluble-in alcoholl (:skolem vaterl)) The first argument to Soluble-in must be 



Applying retrieve-alternate-expression on each of these produces the following 
augments to the existing target description: 



a solid. 
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(B-EXPUnS 

(SBT (PiCKST-DEFIiniOl (SOLUTIOI (:SKOLEN ViTERl)) 

(SET (QViirm (COICEITRiTIOl (:SKOLEN ViTERl))) 

(QViirm (SiTDUTIOI-POIIT (:SKOLEN ViTERl))) 
(lOT (LESS-THil (i (COICEITRiTIOl (:SKOLBN ViTERl))) ZERO)) 
(lOT (LESS-TBil (i (SiTDRiTIOI-POIlT (:SKOLEM ViTERl))) ZERO)))) 
(PROCESS-DEFnmOI (:8X0LEN DISSOLVE) (:SXOLEN PIl) 
(IMPLIES 

(ilD (IIDITIDViL iLCOBOLl 

(conmois (solid iLCoioLi) (soluble iLCoaoLi))) 

(IIDITIDViL (:SKOLSM ViTBRl) 

(COIDinOIS (SOLOTIOI (:SKOLEM ViTERl)) 

(DOIEISED-n iLCOBOLl (:SXOLEM ViTERl)))) 
(SOLOBLB-n iLCOBOLl (:SX0LB11 ViTERl)) 
(6REiTBR-TBiI (i (iNOUlT-OF iLCOBOLl)) ZERO)) 
(ilO (QVilTITT (DISSOL*I-RiTB (:SIOLEN PIl))) 

(QPROP (DISSOLTE-RiTB (:SKOLEN PIl)) (SURFiCE-iREi iLCOBOLl)) 
(6REiTBR-TBiI (i (DISSOLTE-RiTE (:SKOLEM PIl))) ZERO) 
(CTRilS iN0UIT-0F-S2«8 (COICEITRiTIOl (:SXOLEN ViTERl)) 
(i (DISSOLTE-UTB (:SKOLEN PIl)))))))) 

OPEI-iLCOBOL-BEBiTIOR) 

Fignze 5.6: Candidate infeience proposed by mapping dissolving into the disappearing 
alcohol situation. 



(Liquid aleoholl) 

(SnppoTti (Tonehing aleoholl atnoipli«r«) 

(Phyiieal-Path aleoholl ataoiphar* (eonnon-fae* aleoholl atnosphar*))) 
(Snpporti (Op«n b«a]c«r2) 

(Phyiieal-Path aleoholl ataoiphar* (eonmon-fae* aleoholl atnosphar*))) 

The first expression is suggested as an altematiye for (Solid aleoholl). The other 
t^ are alternatives for (lHMri«d-la aleoholl (:ikolMi wat«rl)). All three of these 
facts were abseat from the origiiial target description given SHE, bat are derivable from 
it. For example, (Tonehing alcoholl ataoiphere) follows from a contained liquid in a 
container that is open. (Open beaker2) is the only expression appearing in the original 
target description, but its relevant function was not present. 

The latter two target augments lead to the following base augment 

(Supports (iBMrsed-In saltl waterl) 

(Physical-Path saltl waterl (cooBon-face saltl waterl))) 
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The second task of the probe phase is to seek known objects that may be substituted 
for the skolem objects present in the candidate inference. In this case there is only one, 
( : tkoltm wat«rl) , which indicates there was no target correspondent for wat«rl, the liquid 
into which saltl was dissolving, abduct iv6-retri«v« is invoked on the conjunction^^ 

(Solp.bl«-In alcoholl ?wat«rl) A (Solution ?vat«rl) 
A (LBMrstd-In alcoholl ?wat«rl) 

which fails to find any instantiations for which the conjunction may be assumed. 

At this point, the probe phase is completed. It failed to operationalize the candidate 
inference, but successfoUy retrieved additional relevant information about the target. The 
target and base are augmented with the expressions shown above and mapping is repeated, 
which produces 

(B-Ezplains 

(S«t (Pack«t-D«f Inition (Solution atao8ph«r<i) 

(S«t (Quantity (Concentration atnosphere)) 
(Quantity (Saturation-Point ataosphere)) 
(lot (Less-than (A (Concratration atoosphere)) zero)) 
(lot (Less-than (A (Saturation-Point ataosphere)) zero)))) 
(Process-Definition (:skoleB dissolve) (: skolem pil) 
(Inplies 
(And (Individual alcoholl 

(Conditions (Liquid alcoholl) (Soluble alcoholl))) 
(Individual ataosphere 

(Conditions (Solution ataosphere) 

(Touching alcoholl atmosphere))) 
(Soluble-in alcoholl ataosphere) 
(Greater-than (A (aaount-of alcoholl)) zero)) 
(And (Quantity (dissolve-rate (:skolea pil))) 

(Qprop (dissolve-rate (:skolea pil)) (surface-area alcoholl)) 
(Greater-Than (A (dissolve-rate (:skolea pil))) zero) 
(I- (aaount-of alcoholl) (A (dissolve-rate (:skolea pil)))) 
(I*^ (concentration ataosphere) (A (dissolve-rate (iskolea pil)))))))) 
open-alcohol-behavior) 

This second mapping is much more complete. Importantly, the object atmosphere 
has been introduced as a consideration in the target vocabulary, leading to the complete 
absence of skolem objects.^^ 

simple efficiency improrement on the current implementation should become obvious at this point. 
PHIIEA8 doesn't know that it already determined that (lamersed-in alcoholl Tvaterl) is inconsistent 
in isolation. 

^^Recall that the skolems (:skolm dissolve) and (:skolem pil) are treated specially by PHIIEAS. 
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Repeating the piobe phase of transfer, the following expressions are now found to be 
inconsistent: 

(Solution atmosphere) The argument to Solution must be a liquid, while atmosphere 
is a gas. 

(Soluble alcoholl) The argument to Soluble must be a solid. This was discovered 
daring the previous probe session as well. 

(Soluble-in alcoholl atmosphere) The first argument to Soluble-in must be a solid. 
This was discovered during the previous probe session as well. 

When retrieve-altemate-ezpression is applied to each of these expressions, no 
alternatives are found. In the case of (Solution ataosphere), this indicates PHIHEAS' lack 
of domain theory about general mixtures. 

At this pointy the probe phase is completed. However, unlike the first probe phase, it has 
failed to operationalize the candidate inference and failed to retrieve additional information 
about the target. As a result, the resolve phase begins. There are no skolem objects to 
resolve, so the only task is to create new expressions for the three contradictory ones shown 
above. The following translations are made: 

(SoIutlon-8 ataosphere) 
« (SK-Soluble-4-1 alcoholl) 
(SK-Solublo-ln-4-1 alcoholl ataosphoro) 

These postulate that the atmosphere is analogous to a liquid solution and that alcoholl is 
^soluble in" the atmosphere similar to the way salt is soluble in water. The final transferred 
hypothesis, translated to pure QP theory syntax is shown in Figure 5.7. While usable, there 
is no guarantee it will fully and consi'^tently explain the observed behavior. Verifying that 
is the topic of the next chapter. 

5.3.7 Interactive, empirical transfer 

In complicated analogies, the transfer process will motivate a number of questions that 
are very difficult to answer from a passive, logical analysis. Is condition Ci in the theory 
necessary in this case to bring about the observation? Is this object not mentioned in 
the original observation description present? (Falkenhainer k Rajamoney, 1988) presents 
a protocol for just this type of empirical question answering. In that work, PHIHEAS was 
merged with Rajamoney's (1988a) experimentation system (ADEPT). This provided PHINEAS 
with a means to obtain empirical answers to the various questions that arise during the 
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(DEFFJIOCESS (PROCESS-4 ?T-10 ?T-11) 

liDIVIDUALS ((?V-10 tCOIDITIOIS (LIQUID ?¥-10) (SI-SOLUBLE-4-1 ?V-10)) 

(?V-11 :COIDITIOIS (SOLUTIOI-8 ?¥-ll) (T0UCHII6 ?V-10 ?V-11))) 
PRECOromoIS ((SI-S0LDBLE-II-4-1 ?¥-10 ?¥-11)) 
qUUTITTCOIDITIOIS ((GREATER-THAI (A (AMOUIT-OF ?T-10)) ZERO)) 

REUTiois ((quAirm (oissolvz-rate ?self)) 

(qpROP (dissolve-rate ?self) (surface-area ?T-10)) 

(greater-thai (A (DISSOLVE-RATE ?SBLF)) ZERO)) 
IIFLUEICES ((CTRAIS (AHOUIT-OF ?V-10) (COICEITRATIOI ?V-11) 

(A (DISSOLVE-RATE ?SELF))))) 

(DEFEITITT (SOLUTIOI-8 ?V-12) 

(QUAITITT (COICEITRATIOI ?V-12)) 

(QUAITITT (SATURATIOI-POIIT ?V-12)) 

(lOT (LESS-THAI (A (COICEITRATIOI ?V-12)) ZERO)) 

(lOT (LESS-THAI (A (SATORATIOI-POUT ?V-12)) ZERO))) 

(ASSUilE (SOLUTIOI-8 ATMOSPHERE)) 

(ASSUME (SK-SOLUBLE-4-1 ALCOHCLl)) 

(ASSUME (SK-SOLUBLE-II-4-1 ALCOHOLl ATMOSPHERE)) 

Figure 5.7: Transferred explanation for the disappearing alcohol observation by analogy to 
dissolving. 
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normal course of analogical theory development. It provided ADEPT with a means to obtain- 
ing a restricted number of ordered hypotheses and focused, theoretically motivated queries 
about the world. 

For example, consider how the disappearing alcohol observation may be explained by 
analogy to boiling. When PHIHEAS is operating in isolation, it must assume the presence 
of an ideal heat source (e.g., stove) and alcohol gas. When coupled with ADEPT, PHIHEAS 
is able to ask 

(PrMMt? ?iteaal ((Contaia«d-6as ?it«aBl) 

(Contaia«r*of ?it«aml b«ak«r2) 
(Sabitanc«*of ?it«aml alcoholl))) 

which ADEPT answers positively by asking that litmus paper be placed in contact with the 
air in b«ak«r2 and noting its change in color. Additionally, PHIHEAS is no., able to ask 
if the heat flow process is necessary for the observation to occur. When ADEPT repeats 
the observation using a thermally isolated container (through instructions to a human 
assistant), the alcohol continues to disappear. This leads to a new evaporati . process by 
analogy to boiling which does not require heat flow, something PHIHEAS cannot propose 
when operating in isolation. 
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Chapter 6 

Verification-Based Analogical 
Learning 

The preceding chapters described methods for generating explanatory hypotheses by anal- 
ogy. However, a good analogizer doesn't simply stop with the statement of an analogy. 
How these hypotheses are evaluated and ultimately used is just as important as how they 
are generated. Are the proposed inferences correct, likely to be correct, or consistent? Do 
they provide a solution for the task at hand? The analogy must be evaluated for consis- 
tency and coverage. Interaction with other beliefs must be checked. Indeed, the hypothesis 
must be tested to be sure it predicts the very observation it was intended to explain. 

Nevertheless, the problem of evaluating an analogy's validity and using it in a com- 
plex reasoning task has received little attention. Most models avoid the issue by ei- 
ther stopping once inferences are produced (Holyoak k Thagard, 1988b; Kass et al., 1986; 
Indurkhya, 1987) or requiring that analogy produce d-sound inferences in which the results 
could have been achieved (more slowly) without the analogy (Kling, 1971; Carbonell, 1983a; 
Kedar-Cabelli, 1985b). The validity problem is central to robust analogical processing. In 
particular, if i-sound inferen ces are to be allowed, they must undergo a series of evaluative 
processes. This is reflected in the following requirement: 

• Verification requiretn^.ni: When posrible, the results of aD'Jog> must be tested 
empirically and agaii>jt oti^er knowledge. 

In analo^cal reasoning, the :^c;rification requirement is automatically satisfied as a nat- 
ural side-effect of achieving the goids of the problem solver (i.e., backtracking may occur at 
points of inaccuracies). The verification requirement establishes a similar set of goals for an 
analo^cal learning system - the goal of improving the believed accuracy of its knowledge. 
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VerificaHon-based analogical learning (VBAL) is designed spediically to address this 
requirem^t. It depicts analogical learning as an iterative process of hypothesis forma- 
tior verification, rnd revision, ce ered around the requirement to confirm accuracy and 
increase the likelyhood of bong correct. VBAL relies on analo^cal inference to propose 
explanations and gedanken experiments in the form of qualitative simulation to analyze 
their validity. Specifically, the predictions of a new model are compared against observed 
behavior, enabling the system to test the validity of the analogy and sanction refinements 
where the analogy is incorrect. 

VBAL may be illustrated with the following scenario. When two bodies, one hot and one 
cold, are placed in contact with each other, they will eventually reach the same temperature. 
If the notion of water flow suggests itself, we may construct a model for the situation in 
which a heat fluid is seen lowing" from a higher temperature to a lower temperature. 
Using this new nodd shows that it accurately explains the phenomenon. Thifi is called 
verifying the initial adequacy of the model. The new theory now predicts that certain other 
events must be possible for the given physical configuration, such as the bidirectionality 
of heat flow. We attempt to recollect a prior experience demonstrating this predicted 
behavior or we conduct simple experiments to explore the space of hypothesized behaviors 
for the given objects. This is called verifying the local prcdicUom of the model. If we 
were to explore the consequences of the analogy beyond the current situation, a number 
of additional predictions may be made based upon the intrinsic properties of liquids and 
phyrical objects. For example, conservoHon of matter would lead to predictions based 
on con$ervation of Aeot Exploring the consequences of these additional predictions is 
called verifying the extended predictions of the analogy. This cycle of hypothesis formation, 
confirmation, refutation, and subsequent refinement is the essence of verification-based 
analogical learning. Its focal point is a view of analogical inference as a means io propose 
an initial model of a domain that may need adjustment. 

The conristency of a proposed model may be verified in two v,iys. First, the model may 
be formally analyzed and proven consistent both internally and with respect to existiu^ 
knowledge. Second, the model can be used in some performance task and any resulting 
inconsistencies sought. The first is preferable for obvious contradictions arising £rom simple 
lookups of atomic sentences, as is done during transfer. However, for all but the most 
trivial theories, determining absolute consistency is undeddable (Boolos k Jeffrey, 1974). 
Hence, an approximate form of conristency verification is necessary.^ VBAL is a proposal 
for performance as a L.^s for analysing the global conristency of a hypotherized model. 

^Indnrkhya (1987) and Greiner (1988) use finite complexity bonuda to fonn an approximate definition 
of eondstencj. 
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Unless obvious contradictions are detected, the system should test the model against a 
sample of situations, including the current one. 

In terms of physical models, such simulation-oriented testing corresponds to the idea of 
a gtdanktr? or ^houghf experiment. A complete gedanken analysis consists of taking a 
general theory, or set of premises, and imagining artificial scenarios to see what the theory 
has to say about each scenario. When it conflicts with prior conceptions or when we reach 
mutually inc* patible conclusions, we have a setting for learning. Simulation is needed to 
tease apr \ implicit inconsistencies of belief that are difficult, if not impossible, to detect 
from an analysis of a model in isolation. Only until we try to use a model, with all of its 
potential interaction with prior beliefB, do we expose inconsistencies. 

This chapter describes the use of one particular type of gedanken experiment - quali- 
tative simulation. It begins by reviewing the processes of qualitative reasoning and mea- 
surement interpretation, which generate the predictions of ^ven model and compare those 
predictions to observation. The acceptance criteria used in model evaluation are then de- 
scribed, followed by examples demonstrating the different roles of qualitative simulation as 
an evaluative method. Finally, methods for examining the local and extended predictions 
of an analogy through both simulation and empirical experimentation are proposed. 

6«1 Qualitative Reasoning 

One of the goals of qualitative reasoning research is to formalize the tadt rules people 
use to mentally simulate the behavior of a system through time (Bobrow, 1985). When a 
qualitative model has been constructed, an analysis of the model by the reasoner produces 
an tnv%9%onmtnt - a description of the posrible behaviors for the current physical configu- 
ration. The behavior of the system through time may then be represented as a single path 
through the envisionment. The system is able to provide an explanation for the observa- 
tions and verify the model's consistency if a path through the envisionment formed from 
the model can be found that corresponds to the measurements. 

In terms of envisioning, a ^ven model may produce two classes of predictions: 

• Scenario Prediciian$: Those behaviors predicted from the model applied to the cur- 
rent physical configuration (i.e., the observed set of objects and the structural rela- 
tionships between them). A total envisionment for the current situation will generate 
all possible behaviors. An attainable envisionment will generate the subset reachable 

'German word for ihou^fiU. As far as I can determine, the phrase ^gedanken ezperiment** was popularised 
due to Einstein's masterful use of the technique, although the technique itself dates back to at least Galileo. 
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t Amount-offcs-beaker) 

♦ Amount-of(cs-vial) 
t Pressure-In (beaker) 

♦ PressureHn(vlal) 



— Amount-of(cs-beaker) 

— Amount-of(cs-vial) 

— Pressure-In (beaker) 

— Pressure-In (vial) 



(a) 



t Amount-of(cs-beaker) 
* Amount-of(c$-vial) 
t Pressure-in (beaker) 
4 Pressure-In (vial) 
> (Pressure-In (beaker) , 
Pressure-In (vial)] 



PROCESS UqukHHow 
IndivMuais 

?subtt, liquid 

?trc, Can-Contain (?trct ?subft) 

?src*-ct, Contain«dHJquld 

?dst, Can-Cgntaln(?dflt, ?flubst) 

?dst-cs. Contain«dHJquid 

?path. Fluid-Path (7irc. 7dst. 7path) 
Preconditions Ruld-Allgnod ( ?path) 
QuantityCondltlons PrMsur«(?src) > PrMsur«(?dst) 
Relations 

ftow^ata ■ Prassure(?src) - PrMsura(?dst) 
Ctrans(Amount-of(7src), Aniount-of(7dst), flovr-rata] 



(b) 



♦ Amount-of(cs-beaker) 
t Amount-of(cs-vial) 
^ Pressure-In (beaker) 
\t Pressure-In (vial) 
<(Pressure-ln (beaker) , 
Pressure-In (vial) J 



Amount-of(cs-beaker] 

— Amount-of(cs-vlal) 

— Pressure-ln(beaker) 

— Pressure-In (vial) 

= (Pressure-In (beaker) . 
Pressure-In (vial)] 



Figue 6.1: Qualitative simulation. 



from some spediied starting state, which in this case would be the initial state of the 
observation. 

• Extended Predictions: Those behaviors dei^cribable from the same moddl applied to 
new configurations of objects. 

In these terms, adequacy verification succeeds when a path through the model's scenario 
predictions can be found which corresponds to the observed behavior. Local prediction 
verification con4dsts of confirming the possibility of the alternate scenario predictions, either 
exhaustively or heuristically (Section 6.4). Veri^jring the extended predictions of the analogy 
involves exploration of the derived model's impact on the system's total beliefs through a 
select set of extended predictions. Of course, extended verification is potentially unbounded 
and implementing it would require a specification of what predictions to explore. 

PHIIEAS uses Forbus' (1986b) qualitative process engine (QPE) to produce an envision- 
ment for an analogically proposed model applied to the ^ven physical configuration. For 
example. Figure 6.1(a) shows a beaker connected to a vial and the observed liquid flow be- 
havior for this configuration. Figure 6.1(b) shows a liquid flow process description and the 
scenario predictions it produces for the beaker-vial configuration. The darkened two-state 
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path indicates whcie the liquid flow model consistently explains the observed behavioi. 
Initially, the beaker pressure is greater than the vial pressure, and liquid is flowing from 
the beaker to the vial. Eventually, a new state is reached in which their pressures are equal 
and flow has sto iped. However, the model makes additional scenario predictions. If the 
vial had started out with a higher pressure than the beaker, liquid should flow the other 
way. Consider how a unidirectional valve in the pipe would effect the validity of this predic- 
tion. The sceniirio predictions of Figure 6.1(b) only envision behaviors for two containers 
attached by a liquid path. Extended predictions might include the expected behavior of 
three containers connected in series. The liquid flow model might be extended even further 
to consider its consequences for situations like siphon or faucet flow. 

6.1.1 Measurement Interpretation 

Central to analysdng the consistency of a model through qualitative simulation is the process 
of comparing its predictions to observation and identifying points of discrepancy. This is the 
measurement interpretation problem. Measurement interpretation is the process of finding 
the best, temporal preserving mapping between an observed behavior and a corresponding 
set of envisioned states (Forbus, 1986a; DeCoste, 1989). This correspondence represents 
an interpretation of the behavior by associating it to specific predictions of a model. It 
may be dictinguished from the general interpretation or explanation problem in two ways. 
First, measurement interpretation will assume all potentially relevant theories were used 
in producing the scenario envisionmeLt. Second, measurement interpretation addresses the 
problems of data analysis and real time processing. Mapping a continuous process to an 
internal discretization can be a d]mamic process, dependent upon where gaps, noise, faulty 
sensors, etc. are believed to exist at any given point in the measurement sequence. For 
PHIHEASy perfect data is assumed (i.e., no noise or faulty sensors). The task itself, and 
resolution of the corresponding control issues, is performed by Decoste's dynamic across- 
time measurement interpretation system, DATMI (DeCoste, 1989). This includes selecting 
a best match between model and observation, and deciding when to doubt the model and 
allow a sequence to go uninterpreted. 

Before further discussing the measurement interpretation task, we require the appro- 
priate vocabulary. This draws primarily from Forbus' (1987) logic of occurrence. 

A single, time- varying description of the observed behavior will be called a history H 
(Hayes, 1979). Since there is no unique behavioral description of an observation (Chap- 
ter 4), H rq>r^ents the behavioral description in current use. A history is composed of 
behavioral segments (bsegs) that are temporally extended and spatially bounded. Bsegs 
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divide a measurement sequence into maximal, contiguous intervals over which all measured 
quantities are qualitatively constant (i.e., maintain the same qualitative value). Bsegs were 
defined in Chapter 4. The function Bsegs maps from a history to the set of bsegs that 
comprise it.^ 

An envisionment € represents all possible qualitative states a particular system may take 
on and all legal transitions between them. The function States maps from an envisionment 
to the set of states that comprise it. It will be assumed that € and H agree in perspective 
and granularity. 

Definition 0.1 (Consistent With) Cans%stentWith(hjs) is true whenever hseg h describes 
hehaviar of the system that is not inconsistent with state s. 

The set of possible interpretations of an observed bseg correspond to all envisionment 
states that consistently explain it. This is all states s such that ConsistentWith(hjs) is 
true. A single interpretation must then be selected based on global consistency derived by 
considering temporally adjacent bsegs and states. 

Definition 0.2 (OccursAt) Given s € Staies(S), b € BsegsfH), OccursAt(sjb) is true 
exaetty when the state s represents what is happening during 6. 

Since ^hat is happening" can never be absolutely determined and there may be mul- 
tiple interpretations possible, OccursAt will be taken to mean s is the assumed interpreta- 
tion for bseg b. Since bsegs may partially specify the properties that distinguish individual 
states, a path of qualitative states may occur within a given bseg. The converse of OccursAt 
may be defined as well: 

Definition O.S (Nolnterpret) Given b € Bsegs (H), NoInterpret(b) is true iff for alls€ 
States(e), OccursAt(s,b) is fake. 

A registration iM a mapping which relates an observation to an envisionment by pairing 
bsegs in the observation with corresponding states in the envisionment. In other words, it 
is the chosen interpretation for an observation and consists of all true OccursAt statements 
for bsegs in the observation. 

DATNI takes a history H and an envisionment It returns a registration which is a 
temporally ordered (over H) sequence of OccursAt(si^bi) and NoInterpret(bj) statements. 

'This deviates from the deflnition used ia (Forbusi 1987), which refers to the beharior and lues the 
term "episode* to refer to behariotsl segments. Here I assume a single fixed description of the beharior is 
in use at a particular time and will nse "bseg" to remain conristent with Chapter 4. 
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alcohol-qoinq 



Ds[Amount-of (alcoholi )] -i 

Ds (Amount-of (sk-stoam1-1 ) ] 1 

Ds (Prossure-of (alcoholi ) I -1 

Ds (Pressure-of (sk-steam1-1 ) ] 1 

A [Amount-of (alcoholi } ] >0 

A(Amount-of (sk-steam1-1 )I >0 



alcohol-Stopped 



Ds [Amount-of (alcoholi ) ] 0 
Ds(Amount-of (8k-steami-1 ) J 0 
Ds [Pressure-of (alcoholi ) ] 0 
Ds IPr«ssur0-of (sk-steam1-1 ) J 0 
A[Amount-of (alcoholi )] >0 
A(Amount-of (sk-staam1-1 )I >0 



qstate-l 



Ds (Amount-of (alcoholi)] - 1 

Ds (Amount-of (sk-steam 1-1 ) ) 1 

Ds (Pressure-of (alcoholi ) ) -1 

Ds(Pressure-of (sk-steam1-1 ) ] 1 

A (Amount-of (alcohol 1 ) ] >0 

A (Amount-of (sk-steam1-1 ) ] >0 



T 



qstate-2 



Ds (Amount-of (alcoholi ) ] 0 
Ds (Amount-of (sk-steam1 -1 ) J 0 
Ds (Pressure-of (alcoholi ) J 0 
Ds(Pressure-of(sk-steam1-1)] 0 
A [Amount-of (alcohol 1 ) ] =0 
A{Amount-of(sk-stoam1-1 )] >0 



Observation Envisionment 
Figure 6.2: Fonning a registration between observed bsegs and model derived states. 



For example, Figure 6.2 shows the behavior of a closed container filled with liquid and 
the envisionment from a flawed evaporation model that indicates all contained liquids 
evaporate. The registration returned by DATMI for this observation - model pair is:^ 

((OccursAt qstat«-l •vaping) (lolnterprot •vap-stopp«d)) 

This indicates that the model consistently explains the avaping bseg, which corresponds 
to qstata-l in the envisionment. However, the model is inconsistent with the behavior iu 
•vap-atoppad. The behavior shows no loss of liquid in a non-empty container, while the 
model believes evaporation should continue until the container is empty. 

6.2 Determining the adequacy of a model 

The first step in evaluating a proposed model is to ensure that the model is at least 
consistent with the observation it is intended to explain. This is called verifying the initial 
adequaqf of the model. Recall that mapping and transfer combine to produce an operational 
model. The consequences of this model must then be explored. Does it make sense? Does 

*Thu is modifled for readability. DATMI actually returns the following list of internal ftructnrcs: 
«Plnterp-l q-l,seg-l> (:IO-imRPRBT 1.0 2.0)). 
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it conflict with current beliefiB? Does it provide a complete and consistent explanation for 
the original observation? 

Once an operational model is produced by the transfer component, it is used to generate 
scenario predictions in order to: 

1. Verify initial adequacy. In the ideal case, the proposed hypothesis provides a complete 
and consistent explanation of the observed behavior. 

2. Detect inconsistencies. The model may be inconsistent, either internally or with 
respect to prior beliefi. 

3. Analyze coverage. The base and target behaviors may have only shared a few impor- 
tant properties, leading to extra predictions or unexplained behavior: 

(a) Identify explanatory inadegtMcies. The model may only explain a subset of the 
phenomenon. What is and isn't explained must be identified before attempting 
to augment the existing hypothesis through additional explanation processing. 

(b) Reveal novel predictions. The model may predict additional behavior that was 
not reported in the original observation description. 

The registration is the basis for verification and revision. Specifically, it states the 
conditions for acceptance under initial ade^^uacy: 

Definition 0.4 (Adequacy) A proposed model is considered sAequ^ie iff the registration 
between the predictions € of the model and the observed behavior H is consistent and com- 
plete. A registration is consistent if it assigns every bseg in H to at least one state in £. It 
is complete if there are no OccursAt(s,b) in which bseg b describes a property not described 
in state s. 

Revision occurs where the registration &ils, either where NoInterpret(bj) is true or 
where additional behavior was observed for which the model makes no prediction. 

The method used in PHUBAS for determining initial adequacy is quite simple. This is 
due primarily to the presence of QPE and DATNI, which are treated as primitive operations 
by PBIIEAS. Adequacy determination be^ns when an operational model is presented for 
testing. The model and a description of the scenario under investigation are given to QPE, 
which produces a total envisionment for the scenario. The envisionment and the observed 
behavior description are then given to DATMI, which returns a registration between the 
envisionment and the observation. If the registration is complete, the model is accepted as 
initially adequate by PHIVEAS. If there u more than one initially adequate hypothesis, a 
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preferential ordering is applied. Preferential orderings are handled by PHIHEAS' task agenda, 
which is described in Chapter 8. Finally, if the registration is not complete, revision will 
be attempted on the model if no hypotheses are found to be adequate. 

6.3 Examples 

A simulation experiment may function in one of three general ways: As confirmation that 
the mrdel is complete and consistent, as a discrimination between alternate hypotheses, or 
as an analysis of a model's coverage. This section presents three examples demonstrating 
each of these roles. 

6.3.1 As confirmation 

In the simplest case, a hypothesized model will fully predict the observed behavior. In 
such cases, PHIIEAS is able to verify the model's adequacy by finding a path through the 
envisionment generated with the model that corresponds to the observation. 

Consider the model explaining the disappearing alcohol observation that was developed 
in the previous two chapters. Initially, PHIIEAS was told that the amount of alcohol sitting 
in an open beaker was decreasing, leaving the beaker eventually dry. From this observation, 
it conjectured an alcohol "dissolving into the atmosphere" model through an analogy with 
dissolving (shown in Figure 5.7). Thus, it may now attempt to interpret the original 
situation. The proposed process model ( Proc«M-4) and associated assumptions are given 
to qPE, which produces the two-state envisionment show in Figure 6.3. PHIHEAS finds' that 
the darkened path is consistent with the origmal observation and hence the new theory 
accurately models the heat flow situation (at least in its overall qualitative behavior). It 
has thus verified that the theory is consistent with this and functionally similar instances 
of heat flow, enablirg the new model to be added to the set of known domain theories. 
Experiments may be used to further verify the theory and the new model, as discussed in 
Section 6.4. 

6.3.2 As discrimination 

Another role of simulation experiments is to discriminate between two or -^ore mutually 
incompatible, competing hypotheses. Only a subset of the hypotheses may be consistent 
with the full constraints of the situation. When confronted with a set of candidate hypothe- 
ses, the ideal is to be able to find only a single explanation that makes sense when fully 
analyzed. In that case, the refuted hypotheses may be rejected outright (i.e., no revision 
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DsCinoimt-of (alcoholl)] = - 
Anount-of (alcoholl) > zero 




DsCAiiiount-of (alcoholl)] = 0 
Amoiint<*of (alcoholl) = zero 




Analysis of OPBI-iLCOIOL according to theory 0PEI-iLC0H0L-BEHAVI0R--THEORT-8 

In behavioral segaent 1 

(AHOUR-OF ALCOIOLl) is Decreasing 
(CIiI6E*RiTE (AMOUIT-OF ALCOHOLl)) is Constant 
(A (AROniT-OF ALCOIOLl)) is Greater Than ZERO 

One to the following processes being active: 

PR0CESS-4(ALC0I0L1 ATMOSPHERE) 

which is analogons to DISSOLVE* 

In behavioral segnent 2 

(AHOniT-OF ALCOIOLl) is Constant 

(A (AROUIT-OF ALCOIOLl)) is Equal To ZERO 

There are no processes active. 

Figure 6.3: Demonstrating the adequacy of the proposed disappearing alcohol model con- 
structed by analogy to dissolving. The two-state envisionment at the top was produced by 
qPE. The analysis at the bottom was produced by DATHI as a summary of its successfid 
interpretation. 



attempted) and the winning hypothesis retained as the one ^true" explanation (assuming 
only one, rather than all possible, is sought). 

For example, consider the behavicv of a beach ball suspended in an upward column of 
flowing air, as in Figure 6.4. Stores will sometimes reverse the air flow of a vacuum cleaner 
and suspend a beach hi" in the exhaust jet as part of their display. The ball is very stable 
and will remain within the jet even when slapped around. What holds the ball in place 
and why is it so stable (Walker, 1975, problem 4.20)? PHIIEAS is able to conjecture two 
plausible explanations for what is happening. One is derived from the notion of pushing 
and proposes that the air jet is pushing the ball upward from underneath and towards the 
center on both sides (Figure 6.4(a)). The core to this model are the relations: 

Force (ball ^position) > zero 

Force (ball, posit ion) a+ Amount-of (air, position) 
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Figure 6.4: Disciiinmatiiig between two proposed explanations. 



Tlie other explanation is derired from PHIIEAS' approximate knowledge of Bernoulli's effect 
on airplane wings - that air flow can cause upward pull from above (this is a simplification 
of tie actual pressure drop leading to the pull effect). The core component of this model 
is the relation: 

Forc«(ball,poiition) < z«ro 

Fore* (ball, posit ion) a+ Amount-of (air, position) 

When these two hypotheses are "run", we find that only the second produces stability, as 
evidenced by the oscillatory envisionment it produces. If the air were pushing on the sides 
of the ball in proportion to how much air is on a given side, o&etting the ball to the right 
would lead to further pushing by the air to the right - positive feedback and instability. On 
the other hand, if the air where "pulling" on the sides due to pressure gradients, offsetting 
the ball to the right would lead to increased force toward the left - negative feedback 
and stability. Hence, the hypothesis drawn from abstract knowledge of Bernoulli's effect 
consistently leads to the desired behavior and is proposed as the explanation. The pushing 
hypothesis is rejected due its inconsistency with the desired behavior. 

6.3.3 As analysis of coverage 

If a set of explanations are produced and a non-zero subset are found consistent, then it 
is probably reasonable to reject the failed hypotheses on the grounds that better ones are 
known. However, if there is only one hypothesis, or every available hypothesis initially fails, 
tlien outright rejection must be replaced by analysis and revision. 

A propos«Ml explanation may correctly predict portions of the observation, yet leave 
other portions unexplained. Alternatively, it may correctly explain all of the observed 
behavior, yet predict additional, unrecorded behavior. The primary role of simulating a 
flawed model is to see exactly what behavior it does predict, so that points of discrepancy 
may be found between model and observation. 
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Figure 6.5: Determining the coverage of a proposed explanation for osmosis. 

Consider the situation in Figure 6.5, which depicts two containers, sharing a common 
bottom and separated by a wall called ^embraue^. Each chamber contains a solution. 
The level of solution Si is observed to be decreasing while its concentration is increasing. At 
the same time, the level of solution St is observed to be increasing while its concentration is 
decreasing. In other words, oimo$i$ is taking place. When PHIIEAS is assigned to explain 
the situation, it knows nothing about osmosis and focuses first on liquid flow due to the 
overall behavioral similarity. An initial ^quid flow^ model is proposed and used to envision 
the possible behavior for the situation. This model corresponds to solution flowing through 
the membrane from 5i to 5^. It correctly predicts the change in fluid levels, as well as the 
drop in S^'s concentration (due to flow of 5i's lower concentration). However, the model 
also states that 5i's concentration should remain constant, in conflict with its observed rise 
(see Figure 6.5). The model is therefore inadequate and must undergo revision, hopefully 
to realize that solvent is flowing rather than the entire solution. 



6.4 Further Verification 

Given that the new model is consistent with what has been observed, we may now seek 
further en^pirical confirmation of its validity through various stages of experimentation. 
A time-based planner has been used to explore the possibility of constructing simple ex- 
periments to perform prediction verijlcation. More complex experiments may have to be 
performed to verify eztensians of the analogy. 
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6*4.1 Verifying local predictions of the model 

Given that the consistency of the model has bjen confirmed by finding a path in the 
envisiomnent that explains the current situation, what may be said about the other paths 
in this new envisionment? The new model states that the system should be able to exhibit 
the other behaviors described by the envisionment. 

A temporal planner, TPLAN (Hogge, 1987c), and its associated domain compiler {Eogge, 
1987b, 1987a) have been used to explore constructing simple experiments for local pre- 
diction verification * This planner is able to develop plans leading to situations in the 
envisionment, allowing it to manipulate the scenario through various paths and Confirm 
or disconfirm the validity of the model's predictions. Only one example was successfully 
completed until problems in planning technology (and other commitments) delayed further 
investigation, such as the need to plan for prevention (Hogge, 1988). However, this work 
demonstrates several nice properties. First, the domain compiler enables dynamic gener- 
ation of new planning knowledge in response to learning new theories about the physical 
world. Learning a model of heat flow enables learning new planning knowledge to raise the 
temperature of objects. Second, because the envisionment explicitly describes all states 
relevant to local prediction verification, creative experiment design of novel situations is 
unnecessary. 

Consider the scenario predictions generated for a novel heat flow situation in which a hot 
brick ii cooling off in hot water. Its model may further predict the bidirectionality of heat 
flow, that is, the opposite behavior should occur if a cool brick is placed in hot water. First, 
operators for heating and cooling are created from the system's new knowledge about heat 
flow. Second, the goal is posted to achieve the '*cool brick in hot water" situation identified 
as one of the unconfirmed starting states of the scenario predictions. Starting from the 
current equilibrium state, TPLAN generates a plan to (1) remove the brick from the water 
(to prevent its heating up), (2) heat the water by placing it in contact with a hot stove, and 
(3) return the brick to the now hotter water. When the plan is executed, behavior leading 
from the initial achieved state may be observed and found to match prediction. Notice that 
had the system been testing a new electrical flow through a diode model, rather than heat 
flow, the experiment would have failed, uncovering a flaw in the model. 

'This work was done in collaboration with John Hogg6. 
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6.4.2 Verifying extended predictions of the analogy 

U an analogy proves nsefol in understanding a given phenomenon, it would be wise to 
extend the analogy further and explore the limits of the analogy's validity. For example, 
the water flow - heat flow analogy may be extended by hypothesizing that heat is itself 
a type of fluid and possesses the properties known to hold for fluids (the caloric theory 
of heat). By extending the analogy in this manner, we are forced to conjecture a law of 
conservation of heat which states that heat can never be lost nor created. In the early 
nineteenth century, the caloric theory was widely beUeved and evidence for or against 
conservation of heat was sought. It was Count Rumford's experiments with friction which 
helped lead to the eventual downfall of the caloric theory of heat and supported the energy 
interpretation. While the original flow model may remain essentially intact, its theoretical 
underpinnings originating from extending the analogy to conjecture a hea*. fluid must be 
replaced by a notion of enery flowing. 

6.5 Psychological Relevance 

The qualitative simulation phase of testing used in PHIHEAS is drawn from people's tenden- 
cies to mentally "try out" ideas before acting on them, often called imagery or a gedanken 
experiment. 

Gedanken experiments are frequently used in science, both as an explanatory de- 
vice or at a mwAanism for creative insight (e.g., Newton, 1729; Einstein & Infeld, 1938; 
Dreistadt, 1968; Leatherdale, 1974; Miller, 1986). For example, Bohr and Heisenberg ex- 
plained the Heisenburg uncertainty principle by imaginmg an attempt to determine the 
current state of an electron, that is, its position and velocity. First, obtain a powerful mi- 
croscope with illuminating light of wave-length smaller than the dimensions of the electron, 
say 7-rays (10-"cm) (never mind for now if such a microscope is feasible). Now, imagining 
the actual observation we suddenly get an unanticipated effect - at this wave length, the 
illuminating light bombards the electron, knocking it out of view. Attempting to view 
the electron has changed it's initial velocity by an unpredictable amount (Hanson, 1968, 
Pg- 137). 

A revolutionary gedanken experiment was conceived by Einstein in 1895 which laid 
the foundation for his development of the special theory of relativity. Einstein imagined 
an experimenter traveling at the speed of light. Physics at that time dictated that the 
experimenter should be able to follow a Ught wave, and hence view it as behaving like a 
standing wave. However, according to Einstein's intuition, the laws of optics should be 



independent of the observer's motion. His thpught experiment represented an important 
paradox. 

However, gedanken experiments are not limited to scientific investigation. People have 
been known to invoke mental simulations for interpretation, prediction, or analysis of con- 
jectures in everyday reasoning about the physical world (e.g.. Waltz, 1981; Centner k 
Stevens, 1983). How often have we observed a person make a quick guess to explain 
something, think for a short time, and then declare ^No, that doesn't make sense after 
all.^^? For example, Williams et al (1983) describe a subject explaining the operation of a 
heat exchanger. The subject would mentally simulate what transitions should take place. 
Conflicting predictions, or predictions that could not be justified, triggered a new round of 
model development. Collins & Centner (1987) found similar results in response to the ques- 
tion ^How does evaporation aiFect water temperature?". Their subject initially guessed that 
it doesn't. Then, the subject imagined that the water molecules able to leave the surface 
of the water must be the most energetic, thus lowering the average temperature. Finally, 
the subject imagined the surface of a lake being warmed by the sun, with the deeper layers 
at a lower temperature. As successive layers evaporated, the level would drop, new layers 
would be wanned, and the average temperature would increase. 
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Chapter 7 



Theory Revision 

V/hen a proposed hypothesis is fonnd to be inadequate during veiiiication, that hypothesis 
must be revised before it may be accepted. Since analogy can produce flawed or incomplete 
inferences, any general model of analogical learning must include a complementary model 
of hypothesis repair. 

Most accounts of theory formation and revision have failed to depict the experiential, 
integrated nature of theory development. Thqr focus solely on first-prindples analysis of 
failed theories (e.g., Dietterich k Buchanan, 1983; Rose k Langley, 1986; Rajamoney, 1988a). 
But why should the generation of revision hypotheses be fundamentally different from the 
generation of explanatory Iqrpotheses in the lirst place? Past experiences should still be 
examined to see if any known phenomena might explain the anomaly. Furthermore, the 
additional information available during revision may enable use of hypothesis generation 
techniques that were inapplicable before. 

PHUEAS cannot interact with the world. Instead, it uses a knowledge-intensive, heuristic 
approach to theory revision. This chapter discusses three techniques to analyze anomalies 
and propose plausible revisions. A first principles analysis provides a strong foundation 
and exhaustive source of hypotheses. PrtcedenUguided revision uses knowledge of analo- 
gous phenomena to provide a focused, ordered set of hypotheses. Finally, difference-based 
reasoning examines prior succer les to see what is different about the current situation that 
may provide empirical justification for proposed hypotheses. 

This chapter begins by overviewing the revision process, showing how the three revision 
techrlques combine to provide a focused source of hypotheses. It then describes each 
revision technique in turn. They are only partially implemented^ so the reader should assume 
everything described in this chapter is unimplemented unless explicitly stated otherwise. 
The chapter doses with an example and a perspective look at how this work relates to the 
general theory revision problem. 

125 



7*1 Overview 



A newly proposed hypothesis may be flawed in several ways. It may interact with prior 
beliefs, leading to various kinds of inconsistency. It may only provide a partial explanation, 
thus requiring further interpretation. The hypothesis may be overzealous in that it predicts 
behaviors that did not occur. Finally, it may suiFer from a generalization flaw, in which 
the hypothesis is primarily correct but the conditions on when it should and should not be 
applied are inaccurate. 

Repairing a &ulty hypothesis is a two stage process: 

1. Credit aasignmeni. Identify the portiou^o^ of the Iqrpothesis responsible for the in- 
correct conclusions, either explicitly or by identifying generJ classes of problems. 

2. Repair. Modify the hypothesis so that offending conclusions will be retracted and 
correct conclusions will be maintained. 

A number of approaches to the credit assignment problem have been proposed. Typ- 
ically, they involve analyzing a trace of program execution or dependency structure to 
isolate "points" of failure. For example, an operator leading down the wrong search path 
may be identified (Mitchell et al., 1983) or a wrong conclusion traced to an inappropriately 
applied rule or assumption (Smith et al., 1985; Rose k Langley, 1986^ 

There are three aspects of PHIIEAS' task that lead to difliculties for these approaches. 
First, there is no single answer or goal state, but rather a multi-state description that is 
supposed to contain a subsequence of states matching an observed continuous behavior. 
Hence, the same model ©r assumption might correctly predict one state while failing to 
match another, all for the same scenario. Second, continuous quantities may have multi- 
ple influences. If a quantity is predicted *o be increasing but found to be constant, it's 
not necessarily the case that the predicted positive influence is incorrect. There might 
be an additional, unknown negative influence cancelling the positive influence. Finally, 
a model will typically drive predictions about multiple components of a scenario (e.g., 
amount, pressure^ and volume). Hence, a single faulty model may lead to non-local fail- 
ures. These problems are all analogous to the problems encount. led in multiple fault 
diagnosis (de Kleer & Williams, 1987). 

An example is useful at this point to help clarify the revision task. Suppose in response 
to the disappeirlng alcohol observation PHIHEAS had learned the "evaporation" model 
shown in Figure 7.1.^ This model indicates that any liquid in a container will evaporate 

^Thu model is taken from (Falkeahainer |- Rajamoney, 1988). It was leaned by PHIIEAS with the aid 

126 



(d«fPxoe«lt (PROCESS-3318 ?t-3309 ?t-3310 ?t-3311 ?t-3312) 

IndiTidnali ((7T-3309 teonditioni (Snbitane* 7T-3309)) 

(77-3310 ; conditions (Can-Contain 77-3310 77-3309)) 
(77-3311 : conditions (Containsd-Liquid 77-3311) 

(Contain«r-of 77-3311 77-3310) 
(Snbstanc«-of 77-3311 77-3309)) 
(77-3312 : conditions (Contain«d-6as 77-3312) 

(Contain«t-of 77-3312 77-3310) 
(Snbstanc«-of 77-3312 77-3309))) 
QuantityConditions ( (Graatar-Than (1 (lMimt-of 77-3311)) zaro)) 
Ealations ((Qnaatity (7aporization-Rata 7saU)) 

(Graatar-Than (i (7^poTization-Rata 7salf)) zaro)) 
Inflnancas ((I- (Haat 77-3311) (1 (7apopization-Rata 7sall))) 
(Ctpans (iaonnt-of 77-3311) (inonnt-of 77-3312) 
(A (7aporization-Rata 7salf))))) 

Figure 7.1: A flawed model of evaporation. This model indicates that any liquid in a 
container will evaporate until the container is empty. 



until the container is empty. Consider what happens when that same model is applied to a 
new situation in which alcohol is placed in a closed container. The observed behavior and 
the model's incorrect prediction are shown in Figure 7.2. What went wrong? Examination 
of the justification structure shown in Figure 7.3, which states why the model predicts a 
decrease in the alcohol, provides some clues. The derivative of the amount of alcohol is less 
than zero because the alcohol has the quantity a«ount-ol, process PROCESS-3318 is active 
as the process instance PIO, and its vaporization rate is greater than zero. Further exami- 
nation reveals that PIO is active because it "exists" (i.e., instantiates on known individuals) 
and the quantity condition "amount of alcohol greater than zero" true. Note that there 
ate many places where a justification might be changed to defeat the flawed belief. The 
first simplification made is to reason at the level of processes, rather than about each in- 
dividual belief. Thus, the reasons for believing lctiva(PIO) are most relevant. What 
changes will remove the erroneous belief? One possibility is to add a new quantity condi- 
tion, for example that the amount of alcohol must be greater than some lower limit point 
l(alcoholl). Alternatively, the process' stated effect on the amount of alcohol could be 

^B^unoney's dixected ezpenmentation system (ADEPT). PHZIBIS is unable tc derelop this exact model 
(it wu derived fiom boiling) whei operating on its own, bnt the model has oseM chaiactexistics foi this 
discussion. 
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Figure 7.2: Observed behavior of alcohol in a closed container and the behavior predicted 
by the flawed evaporation model. 



Figure 7.3: Justification structure produced by PEOCESS-3318 indicating why the derivative 
of the alcohol's amount should be negative. Nodes in italic indicate assumptions made by 
qPE during envisioning. 



removed (the Ctrans expression). Caution is needed, for repairing an erroneous prediction 
in one envisionment state may reverse a correct prediction in another envisionment state. 

Due to these complexities, rather than examine the underlying beliefs behind each 
incorrect parameter predict* dn one at a time, global anomalies are classified into failure 
categories. This categorisation is then used by the revision processes to determine possible 
revisions. In describing the alternate categories, the following simplification is made: 

• Process simpUfieaiion: Only revision of flawed process models are considered- 
Individual rules and entity definitions are not subject to change. 

A process is syntactically equivalent to a schema definition. It specifies constraints 
on the objects it may apply to, a set of conditioning relations that state when it may 
Apply (i'^M ^ process' preconditions and quantity conditions), and a set of consequent 
effects on the world. A process is considered active when it is applied in a given situation 
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and inactive wien it docs not apply. For example, a process' preconditions and quantity 
conditions together indicate when it is active and inactive. Note that the same process 
may be both active and inactive in the same envisionment, but never in the same state of 
that envisionment. 

Two assumptions are made in the revision process: 

• Singk influence assumption: There are no influences on an observed quantity 
that is- constant. 

By this assumption, the case of a quantity being constant due to equal and opposite influ- 
ences on it is not considered. 

• Unit influence-chain assumption: Ji the hypothesis that Qi is influencing is 
being proposed, but is currently not believed, the chain of influences from Qj to 
Qa is of length one. 

By this assumption, the case of hidden influencers is not considered. Thia prevents propos- 
ing an influence from Qi to Q2 of the form Qa a . . . Qi . . . a Qi, where each Qi is an unknown, 
hypothesized quantity. 

Due to the limited resolution of qualitative models, qualitative simulators are designed 
to branch in times of ambiguity. A given model may produce a large number of scenario 
predictions, corresponding to different run-time assumptions about quantity relationships, 
object existence, and influence ambiguities. A revision problem that will not have to 
be addressed is revising these types of assumptions, since the qualitative simulator will 
automatically branch on these ambiguities. 

7.1.1 Combining Analysis with Experience Provides Focus 
In summary, revision proceeds as follows: 

1. Failure detection and isolation. Failures are detected when the verification process 
classifies a hypothesis as inadequate. The registration indicates which states cf the 
behavior are anomalous. 

2. Failure classification. The type of failure is classified (e.g., premature stop, 
should cause, etc.). 

3. Propose revisions. Potential revisions of the theory are hypothesized to enable it to 
correctly explain the previously anomalous observation. 
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4. Select revi$ion(s). A subset of the proposed revisions is chosen and revised theories 
formed. These theories must be reexamined for adequacy. 

The task of proposing and selecting revision hypotheses is severely underconstrained. 
Using a purely formal analysis of a prediction's causal structure, the number of possible 
revisions is infinite (e.g., creating an endless transitive chain of new quantities and pro- 
portionalities to hypothesize an influence on qi by qj). Even with the stated assumptions, 
analysis of the causal structure alone will produce a large number of possible revisions for a 
model of any complexity, making it extremely difficult to identify the ^one true fix". Addi- 
tionally, there is no clear best selection criteria. Rose and Langley (1986) have proposed a 
co$t measure for ordering possible revisions, in which revisions to premises supporting the 
fewest beliefs are preferred. However, they correctly observed that some hypotheses about 
an anomaly seem mote plausible than others, for which the cost measure does :ioi account.^ 
I propose that this sense of plausibility arises from experience, both with behavior analogous 
to a given anomaly and with prior applications of the theory under investigation. These 
two forms of analogy may be used in tandem with a first-principles analysis to provide a 
preferential ordering and empirical support for proposed revision hypotheses. 

The revision process is depicted in Figure 7.4. During the verification stage, anoma- 
lies are detected through the registration; bsegs having no interpretation correspond to 
physical states where the theory fails. The failure is classified according to a set of fail- 
ure categories. First principles afudy$%$ is then used to examine the behavior history, the 
failure classification, and the flawed theoretical predictions to propose a list of possible 
changes to the theory. This list is typically long and always unordered. Precedent-guided 
revision also proposes revisions, but from a different perspective. It seeks analogues to the 
current observation that display the same behavioral aspect causing the current problem. 
It then determines how that aspect was explained in the analogue case and suggests revi- 
sior^ that would produce a similar explanation. Revisions drawn from analogous situation 
e. ^ nations have experiential corroboration and are preferred over revisiozu drawn solely 
from an analysis of the situation in isolation. Difference-hosed reasoning seeks empirical 
^lanation for the change in behavior between the current anomalous case and a prior ob* 
servation successfully explained by the model under investigation. It identifies how the two 
scenario descriptions differ ( ACHm$ ^Aim)) &nd attempts to determine which quantities 
might be affected by that change. Revisions concerning those quantities have empirical 
corroboration, while revisions about other quantities are deemed less desirable. 

'In diagnostic reasoning, this obserration is addressed by use of probability measores on possible faults 
(e.g., Buchanan b Shoitliffe, 1984; de Kleer k Williams, 1987). It is not dear how Bayesian probalnlity 
measures for flaws in a model would be obtained. 
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Figure 7.4: Overview of the proposed PHIIEIS revision process. 



7.1.2 Categorizing the Pluw 

The NoInterprtt(hj) statements of a registration between model and observation indicate 
specific behavioral segments that were considered anomalous. Failures are detected and 
isolated by examining the individual bseg disagreements, or considering the behavior of 
a sequence of uninterpretted bsegs. Points of disagreement are then classified according 
to the categories listed in Table 7.1. These categories are similar in intent to the taxon- 
omy discussed in (Smith et al., 1985). However, they primarily assumed correctness of the 
underlying domain theory and examined fsilures due to violated assumptions. 

Anomalies may be of three general types: conditioning relation^ effects relation, or 
hehationd gap. Each of these distinctions has an associated set of revision distinctions. 

7.1.2.1 Conditioning relation 

The conditioning relations of a process (i.e., preconditions and quantity conditions) indicate 
when the process is active and inactive. Problems related to incorrect conditioning rela- 
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• Conditioning relation. If knowledge is applied when it shouldn't, or not applied whi^ti 
it should, then the conditioning relations describing applicability must be revised. 
This corresponds to the following pair of possibilities: 

!• Prevent : A theory was applied when it should not have been. This may be 
farther classified into: 

(a) Stopped: The theory says it should be active, the behavioral state indicates 
inactive, and the theory was correctly active in some non-empty set of states. 
Having time in the representation enables two further distinctions: 

i. Premature-stop: Stopped prior to, but on the way to, stopping nor- 
mally. 

ii. Chance-stop: An unanticipated ending. 

(b) Blocked: The behavior indicates always inactive, while the theory some- 
times or always indicates active. 

2. Cause: A theory was not applied when it should have been. This may be further 
classified into: 

(a) Kept-going: The behavior kept going in all the same directions, yet the 
theory driving it went inactive. 

(b) Is-going: The theory is always inactive, while the behavior indicates that 
it should be active at tim^. 

• Effects relatian. When a packet of knowledge is properly applied, it may still be 
inadequate if its believed effects are in error. There are two possibilities: 

1. Should-cause: Additional behavior was observed that wasn't accounted for. 

2. Shouldnt -cause: Additional behavior was predicted that didn't occur. 

• Behaviorat gap. U a sub-sequence of behavioral states is unexplained, perhaps with 
surrounding states properly explained, there might be more wrong than simply a 
precondition or effect relation of a theory. Additional theories may be required to 
incrementally develop a complete pictue through multiple models. Alternatively, the 
existing theory might be faulty beyond repair and need to be replaced. 

\hlt 7.1: Categories of potential flaws. 
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tions appear in two forms: a process is not prevented from being active when it shouldn't 
(Prevent), or excessive constraint is causing a process to be inactive when it shouldn't 
(Cans*). 

The Prevent condition holds for an active process instance if every quantity influenced 
by that process (1) conflicts in the given bseg and (2) was observed to be constant during 
that bseg. The Prevent condition may be further specialized. If the process instance 
is correctly active in at least one bseg, then the problem is the more specific Stopped 
condition. Otherwise it is a Blocked condition. A Stopped condition is classified as a 
Premature-stop if the process stopped prior to, but on the way to, stopping normally. It 
is classified as a Chance-stop if no future stop was anticipated, that is, it was not heading 
for a limit point. The evaporation anomaly described above was of type Premature- stop. 

The Cause condition holds for an inactive process instance if every quantity influenced 
by that process (1) conflicts in the given bseg and (2) was observed to be changing during 
that bseg. The Cause condition is further spedalized into two subcategories. It is a 
Kept-Going condition if the process was active during the previous bseg, the previous bseg 
was correctly interpreted, and the derivatives of all quantities influenced by the process 
instance are the same in both the current and previous bsegs. It is a Is-Going condition if 
the process is never active in any bseg of the observation, while the quantities it influences 
are observed tc be changing in some bsegs for which Nolnterpret is true. 

7.1.2.2 Effects relation 

The effects relations of a process indicate how it influences continuous quantities when ac- 
tive. There are two types of influence forms. QpropCqi.qj) indicates that qi is qualHatively 
propoHional to qj. In QP theory this is called an indirect influence. All else being equal, 
qi will increase if qj increases and decrease if qj decreases. I+Cqi.q,) indicates that the 
derivative of qj, qi, is equal to the sum qi . . . + q, + . . . q,-. In QP theory this is called a 
direct influence. 

Problems related to incorrect effects relations appear in two forms: If processes are 
believed active for ac uninterpreted bseg and all of the quantities they influence are chang- 
ing in a manner consistent with those influences, while other uninfluenced quantities are 
observed to be changing, then the condition Should-Cause holds for that bseg. If there are 
quantities changing in an uninterpreted bseg in accordance with influences from processes 
believed to active in that bseg, while other quantities influenced by those processes are 
constant in the given bseg, then the condition Shouldnt-Cause holds for that bseg. 

For example, if a proposed model of osmosis (solvent flowing through a membrane) 
incorrectly states that solution is flowing, rather than solvent alone, it will correctly predict 
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the change in solation amount but incoTiectly predict that the solution's concentration will 
remain constant. This is a Should*cau8« anomaly, since the osmosis model should cause 
a change in the concentration quantity. 

Note that these conditions are sensitive to the consistency of adjacent bsegs. If a 
quantity is consistently predicted to be influenced by an active process in one bseg, then 
Shouldnt-Cau8« will not be true of the next bseg if that quantity is constant but predicted 
to be changing during that next bseg. 

7.1.2.3 Behavioral gap 

If an anomaly cannot be classified as a specific problem with conditioning or effects re- 
lations, it is classified as a behamorat gap. A behavioral gap indicates there is a severe 
problem with the model. For example, if there is a sequence of behavioral segments for 
which Nolnterpnt is true, it might be the case that the model is incomplete and additional 
explanation hypotheses are needed. This case is currently not addressed. 

7.2 First Principles Analysis 

A first principles approach to revision is one which depends strongly on the underlying 
domain theory, analyzing the causal structure supporting a flawed set of beliefs and using 
only weak heuristics to guide search. It is a powerfnl technique in that the reasons for a 
flawed belief may be identified explicitly. Previous sections have described how anomalous 
bsegs are detected and categorized. This section describes how those classifications may be 
used to propose revisions to a flawed process description. 

There are five types of revision hypotheses, which either Add, Remove, or Change ele- 
ments of a flawed theory: 

• Quantity condition modification. Add or remove a quantity condition for a specified 
process (e.g., Greater-thanCqi^q^)). 

tX€/po9%ikddlQCiin€qfudiiylquanUtyi,quanHty2J$ process), theory}) 

• Precondition modification. Add or remove a precondition for a specified process (e.g., 
IslrmM-Qpmnifiuid-path)). 

VTopon%ikid[JPCiatomiC'sentence, process), theory]) 

• Effect modification. Add or remove an effect relation of a specified process (e.g., 
QpropCqi.qj)). 
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Propos«(idd [Effect (in^ttence« process), theory]) 

• Participants modification. Change the individuals taking part in a specified process. 
Propof#(Chaag«CliidiTidual(oi!fow. process) .Indiridusliobjr^, process), theory]) 

• Merge modeb. Recorsiyely invoke the normal interpretation process to augment the 
existing theory and incrementally develop a complete global interpretation. For ex- 
ample, fully explaining a single observation possessing both thermal and chemical 
aspects may require multiple analogies. 

At this time, only modification of conditioning relations, effects relations, and a process' 
participatory objects will be considered. Full recursive invocation of PHIHEAS, leading to 
integration of multiple models of the observation is an important problem, but beyond the 
scope of this thesis. 

The ability to examine the behavior history is central to this process. If a model is 
inconsistent with some state, what the behavior and model were doing in the previous 
state, as well as in the next state, will have a lot to say about the problem. Static analysis 
of a single anomalous state lacks important contextual information that may rule out many 
revision hypotheses. For example, suppose a quantity is decreasing, which is consistent with 
the model, then suddenly stoats while the model states it should continue to decrease. The 
quantity may have reached an important limit point. From the prior history, it is dear that 
if such a limit point was reached, it was approached from above, not below. Thus, QCCq > 
l(q)] is a potential new quantity condition, while qC[q < l(q)] is not. 

Revision hypotheses are proposed by ruxming a set of rules that examine the failure 
category, the observed behavior, and the theory's predictions. The rules described below 
are folly implemented, but the current set is incomplete. Therefore, a representative sample 
will be described.^ In describing these rules, a number of quantified variables are used, th 
is the theory under investigation, pi is a process instance that is part of the theory, b, bI . 
s2 *. . . are behavioral segments of the observation, and q denotes a quantity. 

7.2.1 Conditioning relations 

Revision of a process' conditioning rr^atious is sanctioned when an anomaly is associated 
with one of ihe Prevent or Cause categories. The revision proposal rules examine the 
predicted activity of the process and compare that to its apparent activity as indicated by 
the observation. 

'Some of the rules described were originaily developed by Shankar Rajamoney. Rajamoney (1988b) is 
examining the same levinon problem bom a different perspective. 
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7. 2.1 A Prevent category 



A prerent condition occurs when a process is believed active when the behavior indicates 
it is inactive. If the behavior indicates the process has transitioned from active to inactive, 
or is inactive and will become active at some latter time, then the process is in a temporary 
stopped condition. 

The following rale addresses the case of a process failing to transition from active to 
inactive. 

Rule 1 (Approaching from above) If a quantity is decreasing and then prematurely 
stops, it may have reached an important limit point, liait(q)| that the quantity must be 
greater than, 

Behavior^IndicatesCth, Diiring(Pr«suitTire*Stop(pi) , s2)] A 
Process*InfliieBces(pi» q) A 

ObservedDhiringCDecreasingCq), si)] A HeetsCsl, s2) 

=> Proposc(AddCqC(6rsatsr*th«nCq* liait(q)], pi), th]) 

The rale applies to a situation where pi, which influences quantity q, prematurely 
stopped and q was decreasing in the previous bseg. It proposes adding a quantity condition 
that requires q to be greater than some limit value for that quantity (yet to be determined). 
A reciprocal rule adds a Less^than quantity condition if the quantity was increasing in the 
previous state. 

A specialization of this rale accounts for two quantities of the same type approaching 
each other toward equality: 

Rule 2 (Dual approach) If a quantity decreasing for one object and increasing for an- 
other object prematurely stop at the same instant they reach equaUty, it may be that the 
decreasing quantity must be greater than the increasing quantity for the process to be active. 

Behavior-Indicates [th, OuriagCPreaatuxs-Sl^opCpi) , s2)] A 
Process*Iafluencss(pi, q(obji)) A Procsss-InflusncssCpi, q(obJ2)) A 
ObssrvedDhirlagCDsereasingCqCobJi)), sD] A 
ObssrTedD>uriag(Incrsasing(q(obJ2)), si)] A HsstsCsl, s2) 

^ Proposs(AddCqc(6reater-thanCq(objl), q(obJ2)], pi), th]) 

A process' activity may also be conditioned on an equality relationship between two 
quantities: 

Rule S (Transition from equality) If an equality transitions to inequality at the same 
moment a process prematurely stops, that eqtudity may be a prerequisite for the process* 
activity. 
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B«liaTior-Indicat«g[tli, Dnring(Pr«Batiir«-Stop(pi) , 82)] A 

ObMPT«dCDuriiig(CoiuitMit(ql), ■!)] A 0bMrY#dCDiiring(Equal-to(ql,q2), A 
ObMrT«d[During(-iEqual-to(ql, q2), 12)]. A MMtiCil, 12) 
ProppMUddCQCCBqual-toCql, q2], pi), th]) 

When the behavior indicates a process should never be active, while the process is 
believed to be active during periods of the observation (i.e., Blocked), a precondition may 
be absent. Preconditions state important physical conditions whose change cannot be 
predicted in terms of changes to continuous quantities (e.g., a switch being on or off). 

Rule 4 (Missing precondition) If the behavior indicates a process is blocked, an un- 
knovm condition, e, for the process' activity may not hold in ihe scenario. 

Behavior-Indicates [th, Blocked(pi)] 

Propose(AddCPC(c(pi), pi), th]) 

7.2.1.2 Cause category 

A cause fidlure occurs when a process is believed inactive when the behavior indicates it 
is active. If the process incorrectly transitions from active to inactive, then the anomaly is 
a kept -going condition. 

Rule 5 (Unnecessary greater-than condition) If a process transitions from active to 
inactive due to a gieater-than quantity condition, while the behavior indicates a kept -going 
condition, that quantity condition may be unnecessary. 

Behavior-IndicateaCth, During(Iept-Going(pi), 82)] A 
QCCGreater-thanCql, q2], ?pi) A 
Observed CDnring(Greater-thaa(ql,q2), ■!)] A 
QbaervedCDnring(Eqaal-to(ql,q2), 12)] A Heets(sl, 12) 

Propose(ReTCveCqc(6reater-than[q, linit(q)], pi), th]) 

A reciprocal rule removes a Less-than quantity condition if it was the reason for a pro- 
cess stopping in a kept-going state. Two other rules remove equality quantity conditions 
whose change to inequality was the reason for a process stopping in a kept-going state. 

If a process description contains an unnecessary precondition, failure of that precondi- 
tion will lead to occasions when the process is incorrectly classified as inactive throughout 
a given scenario (the is-going category): 

Rule 6 (Spurious precondition) // the behavior indicates a process is active, while the 
process is believed to be inactive throughout the scenario (i.e., is-goingj, one of the process' 
preconditions may be unnecessary. 

Behavior- Indicates [th, Is-6oing(pi)] A PC(c, pi) 
^ Propose(ReaoveCPC(e, pi), th]) 
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7.2.2 Effects relations 



A should*Cftus« condition occurs when a process correctly predicts the behavior of all 
quantities it influences, yet other quantities are observed to be changing that the process 
does not influence. 

Rule 7 (Missing proportionality) If an uninfluenced quantity is observed to be chang- 
ing, it may be proportional to an influenced quantity that is changing. 

Behavior-Indicates Cth« Sliould-cause(pi)] A 
ProcMS*Influences(pi« ql) A *iProcess-InflueACM(pi« q2) A 
DnringCActiveCpi), s) A 

Observed[Ihiring(Increasing(ql)« s)] A Observed CDuring(Increasing(q2)« s)] 
=> Propose(iddCEffect(Qprop(q2,ql) pi), th]) 

Three other rules treat the various permutations possible (e.g., ql increasing and q2 
decreasing leads to a qprop-(q2»ql) (inversely proportional) being proposed). 

A shouldnt*cause condition occurs when a process correctly predicts the behavior of 
some of the quantities it influences, while the other quantities it influences are observed to 
be constant. 

Rule 8 (Spurious proportionality) If an influenced quantity is observed to be constant 
whik the quantity it is proportional to is changing, the belief in the proportionality may be 
incorrect 

Behavior-Indicates [th, Sliouldnt-cause(pi)] A 
Effect (QpropCql q2), pi) A Diiring(Active(pi), s) A 
ObserTe^''^nring(Constant(ql)» s)] A 0bservedCDuring(-iConstant(q2)« s)] 
^ Propose(K«aoveCEffect(qprop(ql«q2) pi)« th]) 

7.2.3 Participants 

Some should*cau8e or shouldnt*cause conditions are due to the process describing the 
behavior of the wrong set of individuals. For example, when two objects or quantities are 
closely related (e.g., a solution and its solvent), a misaligned analogical m&pping may be 
formed which places the wrong target item in correspondence with a given base item. The 
problem might be detected when the target item fails to support the required predictions. 
Only the decomposition rule has been used: 

Rule 9 (Decomposition) If the Qm4>unt of a mixture is correctly believed to be changing, 
but the belief that the relative proportions of the mixture^s constituents are constant is 
incorrect, it may be that the amount of a constituent alone is changing, rather than the 
mixture as a whole. 
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Bchavior-IndieatMCth. Shoiild-eani«(pi)] a 
IndiTidnalCa. pi) a MizturaCa) A CoBpon«nt-ol(m, c) A 
ProeMg-Iafln«nc«i(pi, lw»nnt-of (m)) A 
nbi«rT«dCDt»ring(D«cp«Ming(lw)Mit-ol(»)), g)] a 
Tpr«diction[th, Ihiring(D«cp«Ming(la»unt-of (m)) , s)] A 
Obi«rT«dCDuring(D«cp«aiiag(P«pc«ntag«-of(c)), s)] a 
Tpp»diction[th, Ihipiag(Coii8t«nt(P«pc«nt«g«-of(m)), s)] 

=> Ppopoi«(ChMig«ClndiTidual(B, pi), IndiTidual(c, pi), th]) 

7.2.4 Closed Container Example 

Let us return to the situation described at the beginning of thi< chapter, in which al- 
cohol in a closed container violated PHIHEAS' model of evaporation. The first step in 
revising ihj model is to classify the apparent anomaly. In this example, the problem is a 
pr«Batur«-stop: the model predicted that the amount of alcohol would decrease until the 
container was empty; the observation showed the alcohol's decrease stopped well before its 
aiuount reached zero. 

The second step is to run the revision rules. For a premature-stop, this requires con- 
sideration of the various quantities' behavior immediately prior to the stop. The compi s 
set of proposed revisions, in conjunction with the behavior that suggested each revision, is 
shown in the following table: 

Derivative Va/iie Propo$ed Adiitiotu 

DiLl«oiint-of(*lcoholl)] -1 QC(Gp«at«p-tmaitA«ount-of (alcoholl) ,linit(*lcoholl)])' 



Di[l«oimt-o«(ik-it«Mil-l)] 1 QCa«M-th«iU«ount-o«(ik-,t««l-l) ,ll»it(ik-iteMil-l)]) 

QC(6r«at«p-th«n[Amouiit-of(*lcoholl),lmount-of(ik-iteaml-l)]) 
5i[PrM«ir«(«lcoholl)] -i QC(Gp«at«p-than[PPMiur«(«lcoholl) ,li«dt(*lcoholl)]) 

Di [PPMiur«(ik-it««il-l)] 1 QCaMi-tluui[PrM.ur«(.k-it«Mil-l) .limit (■k-ita«Bl~l)] ) 
Di[TMp«p.tur«(.lcoholl)] -1 QC(GpMt«p-th«»[T«i[p«patnr«(«lcolioll) ,ll«lt(alco>aoll)]) 
DiDfaatCalcoholl)] -i QC(fiPMt«p-thM»Dl«at(«lcoholl) ,lliilt(«lcoholJ)]) 

Each of th*- seven proposed revisions will enable PROCESS-3318 to consistently explain 
the observed behavior. The question remaining is how to choose which revision to make. 
Notice how some revisions seem more implausible than others (e.g., 
QC(6p«at«p-th«nCl«ount-of (alcoholl),li«it(alcoholl)])). This is the topic of the next 
two sections. 



7.3 Precedent- Guided Revision 

While first-principles analysis is able to identify the set of possible revisions for • flawed 
theory, it has two important limitations. First, it offers no preferential ordering on a set of 
revision hypotheses. Second, many of the revisions it proposes contain unknown quantities, 
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expressed as skolem functions over an existing quantity. For example, the approaching from 
aftoM rule proposes the quantity condition Greater*than[q» limit (q)]. 

Precedeni^guided revision addresses both limitations by recognizing that experience is an 
important factor in selecting among alternative revisions. First, it ranks revision hypotheses 
according to their experiential plausibility by seeking understood behavior that is analogous 
to the current anomaly. Second, adapting explanations of analogous behavior often attaches 
known concepts to the unknown quantities proposed by the first-principles analysis. For 
example, a quantity condition mapped from a prior explanation will have a known quantity 
in place of liait(q) in 6rMter*than[q» llaitCq)]. 

The procedure is an adaptation of PHUEAS' normal explanation process: 

1. Identify the aspect of the current behavior that is anomalous. 

2. Access potential analogues, requiring that the behavioral mapping contain the anoma- 
lous aspect of the current behavior. This uses PHIIEAS' standard access mechanism, 
with a candidate analogue rejected if the behavioral match fadls to contain a corre- 
spondence for the anomalous aspect of the current situation. 

3. Identify what explained the relevant aspect of the analogue behavior. Once the 
relevant aspect of the analogue behavior is detected during access, the underlying 
explanation for that component of its behavior may be retrieved. Tl ^.^ uses the same 
explanation retrieval mechanism used during PRIIEAS' normal mapping process. 

4. Map those explanation components to the current situation. 

5. Propose the mapped elements as plausible revisions to the flawed theory under inves- 
tigation. 

The only component of this proce ^ that has not been described in previous chapters is 
the first - identify the anomalous aspect. This information is provided by the observation, 
registration and failure categorization. The process is best demonstrated by example. 

7 3.1 Closed Container Example 

Recall that the closed container anomaly for PROCESS-3318 was a premature^stop failure. 
In terms of the specific situation, this indicates that the four liquid alcohol 1 quantities ( 
iMDuat-of , Preiiure, Traperature, and Heat) were decreasing, the two gas sk-steaml-l 
quantities ( lBount*of and Prea a ure) were increasing, and the behavior stopped prior to 
the amount of liquid reaching zero. Thus, an understood behavior is sought which must 
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provide a ^decreasing and transitioning to stopped prior to reaching zero" analogue for 
the amount of alcoholl. Recall from Chapter 4 that PHIHEAS knows of four analogues 
to the disappearing alcohol scenario: water flowing from a leaky cup, water flow between 
two containers, boiling, and dissolving. Not all of these analogues display the requisite 
behavioral aspect. The leaky cup scenario stops when the amount of water reaches zero. 
This is precisely the kind of behavior that is not relevant to explaining the current failure. 
The same reasoning rejects bcriling as a relevant analogue. Mapping the liquid flow and 
dissolving domain theories suggests two possible revisions: 

Liquid Flaw: qC(6reater-thaACPrMSUM(alcolioll) ,Pressure(sk-steaal-l)] ) 
Dissolving: qc(Uas-thaa [AMuat-of (sk-tteaal-l) .Saturation- Point (sk-steaml-1)] ) 

The pressure inequality condition mapped from the liquid flow situation is inconsistent 
with the observation. Si ice the pressure at the bottom of the liquid alcohol is always 
greater than the pressure of the gas alcohol, a tranrition to equality never occurred. The 
saturation point condition suggested by dissolving replaces a less precise version suggested 
earlier: 

qC(LM8-than[iMunt-of (tk-tteaal-l) •liBit(tk-steaal-l)] ) 

7A Difference-Based Reasoning 

The preceding two sections described formal and experiential grounds for generating and 
ranking revision hypotheses. One other important due is often possible: can any of the 
novel features about the situation itself be used to explain the anomaly? Specifically, if the 
theory under revision has been ured successfully in the past, what is novel about the current 
situation that could cause the observed change in behavior? By identifying and explaining 
the effects of differences between situations in which a theory is applied, difftrtnct^hastd 
reasoning (DBR) provides empirical evidence for what revision hjrpotheses to consider. 

DBR is a general technique designed to facilitate the resolution of expectation failure. 
(Falkenhainer, 1988c) describes its use in theory form^ tion, diagnosis, and planning fail- 
ure explanation. It is relevant to situations in which an expectation was violated and an 
instance of the desired performance is available. In the context of PHIHEAS, tie expecta- 
tion failure corresponds to violation of a theory and the instance of desired performance 
corresponds to a prior, successfid application of the theory. 

He?e we examine a very special interpretation of DBR: if the theory hds been used 
jucccssfoUy in the past and a difference may be identified between the current md previ- 
ous situations, determine what quantities that difference has the potential to affect. Any 
revision hypotheses that do not mention these quantities should be discounted. 
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In PHIHEAS, DBR consists of four stages: 



1. Retrieve a situation description successfully explained by the theory under investi- 
gation. Each process description lists the situations it has explained, just as each 
situation stores pointers to the domain theory that explained it. 

2. Compute A, the set of differences between the current and retrieved situations. This 
computation is performed by SHE, where A is defined to be those aspects that failed 
to be placed in correspondence, that is 

A = [?^rf-M]U[7i^-M] 

where Af represents the analogical mapping produced by SHE for the base and target 
descriptions Tiou ^d T^nw 

3. Use the domain theory to predict the behavioral changes A cotdd cause. 

4. Favor revision hypotheses concerning quantities affected by A over revisions concern- 
ing unaffected quantities. 

The only component of this process not described previously is the third - predict the 
behavioral changes A could cause. In general, this is a very hard problem. The most 
advanced work on this topic is Weld's (1988) comparative analysis^ a technique for deter- 
mining the effects of qualitative changes to a system's continuous parameters. However, it 
is not applicable to structural modifications, additions, or deletions. In PHIHEAS, a sim- 
ple mechanism for achieving the desired affect is available: if the domain theory has the 
ability to predict behavioral changes caused by A, these changes will appear as differences 
in the the(»ry's predictions for the two situations.^ Hence, rather than explicitly examin- 
ing all possible ramifications of A, the quantities empirically relevant to revision are those 
for which different behavior was predicted. This requires reexplaining the prior situation 
(uring qPB) and comparing that explanation to the current anomalous prediction. 

Note that DBR is neutral with respect to revision hypotheses involving quantities for 
which the domain theory makes no prediction. Such hypotheses may be produced by the 
precedent-guided analysis, which is capable of introducing new quantities. 

^This ovlj notes changes tnch as constant to increoiing. It does not indicate changes in degree, such as 
an increased late. Weld's comparative analysis would be needed to detect snch differences. 
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7.4.1 Closed Container Example 

PR0CESS*SS18, wliich iails to predict the behavior of alcohol in a closed container, was 
originally developed to erpiain an observation of alcohol in an open container. Consider 
the predicted behavior for the first state in each of the two scenarios: 
JJerttfOltye Caniainer Open Container Closed 



Ds Uaoimt-of (alcoholHl ^ 

DsCAMQut-cf (8)i-8tttaal-l)] 1 i 

Ds|>rMSiirtt(alC9holl)] -1 -l 

Ds CPrMsiirtt(8k-titmil-l)] 0 1 

DsCT«^#ratiirtt(alco]ioll)] -1 -i 

Ds iMmt (alcoholl )] -1 -i 



There is only one change in the model's predictions - in an open container, the gas 
pressure remains constant due to the infinite capacity of the atmosphere. Since closing the 
container caused the steam's pressure to rise where it had not before, revisions based on 
the steam's pressure have empirical justification. There is only one: 

QC(LM8-tlian[Pr«ssurtt(ik-st«aal-l) »liait(sk-st«aBl-l)] ) 

Had the model possessed knowledge of alcohol vapor concentration (as it learns from 
the dissolving analogy), the following revision would be proposed as well: 

qc(Uss-thaaCABOunt-of (sk-stttaBl*l) , Saturation-Point (sk-stcaml-l)] ) 

7.5 Closed Container Example: Denouement 

When the three revision techniques are combined, the following additions to PROCESS'^3318 



are proposed: 



Prised AiaHon 




P-G 


DBR 


qC(6rMtor-tha&[ABoiznt-of (alcoboll) ,liBit(alCQholl)] ) 








qC(Uss-t]ianUBOimt-of (sk-staaal-l) ,Saturation*Poine(sk-8t«aal-l)] ) 








qc(gr#atOT-thM [iMomt-^f (alcoholl ) ,iaouiit-of (sk-stoaal-l)] ) 








qC(teoatox^-tbaB[Prosraro(alcolioll) ,liid.t(alcoholl)]) 








qC(UM-t]uai[Pro8suro(sk-stoaBl*l} ,linit (8k--stoaal*l)] ) 








qc(8roator-t]m[TMporaturo(alcdholl) ,llait (alcoholl )1 ) 








qC(Groator-thnnoat(alcoholl) ,llBit(alcQholl)] ) 









Thus, accoiding to the evide;ice, one of the following proposed additions to PROCESS-3318 
should be selected: 



QC(L«ii-thaa[Pr«iiiir«(sk-it«aml-l) ,liait(ik-it«aBl-l)] ) 
QCCLaii-thaaCAaonnt-ef (ik-it«anl-l) .Satnzatien-PeintCsk-sttaml-] )] ) 
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Since the fonnei requires postulating an unknown quantity, the second should be chosen 
u the best revision to make. Notice that because saturation was not a concept present in 
the model under revision, DBR was unable to predict how it would be aiFected by a closed 
container. Thus, DBR neither supports nor discounts the second hypothesis. 

7.6 Perspective 

This chapter has proposed that relative likelihood measures for ordering potential revision 
hjrpotheses arise from experience, both with behavior analogous to a given anomaly and 
with prior applications of the theory under investigation. At this time, it remains simply a 
proposal. The first-principles rules have been fully implemented. However, the installation 
of precedent-guided revision and DBR has only recently begun. 

The analogy approiich described above is designed to provide focus to an otherwise un- 
guided first-principles technique. An alternative approach to this problem is experimentation^ 
based theory revision (Rajamoney et al., 1985; Rajamoney, 1988a), which prunes inconsis- 
tent revision hypotheses through directed experimentation." By splitting the set of pos- 
sible revisions through discrimination experiments, Rajamoney's ADEPT system is able to 
isolate the appropriate change to a flawed theory. However, it has the potential to sanc- 
tion many experiments, often on hypotheses that look silly to a human observer, because 
it lacks experiential knowledge indicating what is likdy. This was the motivation be- 
hind combining experimentation-based theory revision with analogical hjrpothesis gener^k- 
tion (Falkenh«uner k Rajamoney, 1988). These two approaches to theory development are 
complementary. ADEPT was provided PHIHEAS' analogical mechanism to focus the revision 
process. At the same time, PHISEAS was given the ability to interact and ask questions of 
the world through ADEPT. 

For example, the union is sometimes crucial for analyzing the coverage of a model. An 
analogy will often predict additional, unobserved behavior. Did this additional behavior 
actually happen, or does the prediction represent a flaw in the model? Some of these 
predictions may be refuted logically, by relation to what is already known. Others, however, 
must be empirically tested by repeating the scenario if possible and specifically looking for 
the predicted properties. 

In one of the unplemented examples, the loss of alcohol sitting in an open container 
is explained as being analogous to the vaporization portion of boiling. When the new 
evaporation model is used to anticipated what should happen in the given situation, a new 

•An indepth study of the revision problem maj be found in (Rajamoney, 1988b). There he presents 
two techniques not used in PHZIEAS, active experimentation and exemplar-based theory r^ection. 
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secondary prediction is produced - the alcohol's temperature must have dropped, due to 
the loss of latent heat during vaporiiation. Since the alcohol's temperature behavior was 
not oiig^ally reported, ADEPT called for the physical scenar'n to be repeated and changes 
in alcohol temperature noted. The test confirmed the evaporation theory's hypothesis. 




Chapter 8 

The PHINEAS System 



PHUEAS is an explanation system which uses analogy as the primary source of hypothesis 
generation, rather than one of the more conventional abductive, unification-based methods 
(e.g., Chamiak, 1972; DeJong, 1982; Forbus, 1986a; Josephson et al., 1987; Mooney, 1987; 
Pople, 1973; Reggia, 1983; Simmons, 1988). In preceding chapters, the individual stages 
embodied in PHIIEAS were presented sequentially, showing how a complete and consistent 
explanation of a given observation is developed. This chapter discusses the process from a 
global perspective. It begins by reviewing PHIIEAS in terms of the programs comprising it 
and describes how they interact. The chapter then discusses the criteria used to prefer one 
hypothesis over another. This is a primary determinant of how PHIIEAS' flow of control 
moves from one stage to another and from one working hypothesis to another. Finally, 
the disappearing alcohol example is presented in its entirety to show how the dissolving 
analogue that has been described in preceding chapters is but one of several considered. 



8.1 Program components 

The VBAL approach to analogical learning and reasoning is knowledge-intensive. It re- 
quires the ability to make analogical comparisons, perform deductive, abductive, and qual- 
itative reasoning, and analyze the completeness of a theory with respect to the observations 
it should explain. In support of these tasks, PHIIEAS uses three auxiliary modules: 

SME: The structure^mapping engine functions as the system's mapping module, identify- 
ing rimilarity and proporing candidate inferences to posit explanations. 



146 



QPE: Forboi' (1986b) qualitative process engine is used to envision the scen&rio predictions 
of a hypothesised modd.^ 

DATMI: Decoste's (1989) dynamic across-time measurement interpretation program serves 
to relate observations of physical behavior to the predictions of a qualitative theory. 

One of the goals in the construction of PHIIBAS was to apply the model of analogy 
develq>ed in this thesis to a nontrivial reasoning task. To that end, PHIHEAS embodies 
representation and analysis techniques £rom the state of the art in qualitative physics in 
order to maximirc scale and generality. It is a large program, consisting of over 6,000 lines 
of CommonLisp code and 435 functions.' When combined with the other program modules 
making up the complete system (i.e., SHE, QPE, and DATMI), this rises to 39,260 lines of lisp 
code and 3,000 functions. 

A block diagram of PHUEAS showing the five primary stages of operation and their 
interaction with the various program modules appears in Figure 8.1. These five stages 
correspond to the preceding four chapters of this thesis: 

1. Behavior match. A new observation triggers a search for previously understood 
experiences that exhibited analogous behavior. Abstractions of the observed situation 
and its behavior are used to focus attention on a potentially relevant subset of memory. 
Each experience in this subset is then compared at a detailed level to the current 
situation, using SHE as the comparative melanism. 

2. Theory generation. The central objective of the second stage is to produce a folly 
operational initial hypothesis about the current domain. This has two components. 

(a) Mapping. First, the models used to explain analogous aspects of the recalled 
experience are retrieved and SHE is used to analogically map these into the 
current domain. This mapping is guided by the initial correspondences found in 
the behavioral comparison. 

(b) Transfer. Second, to operationalize the model, the consistency of its expressions 
must be ensured and any unknown skolem objects it requires must be inferred 
from the domain theory or thdr existence postulated. A map and analyze cycle 
may ensue. 

^qPB, like PHimS, is actually a wjwiem of programi eonsistiag of the QPE code itself, delQeer's ATMS, 
and Forbns t deKleer's nile-based problem solver for the ATMS, called ATMoSphere. 

'The number of lines represents jut lines of code. Blank lines and comment lines axe not included. 
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Figure 8.1: Block diagram of the Phineas system modules. 

Gedanken analysis. The operational model is used to construct an explanation of 
the present observation. The model is given to QPE, which generates an envisionment 
for the model applied to the observed physical configuration. DATNI then compares 
these predictions to the observation and either determines that the model is adequate 
or identifies points of discrepancy. 

Revision. If an initial hypothesis fails, or an old hypothesis is inadequate for a 
new situation, an attempt should be made to adapt it around points of inaccuracy. 
A model of revision is advocated which relies on past experiences to guide the for- 
mation and selection of revision hypotheses. It considers ^ '^avior analogous to the 
current anomaly and considers how the c^irrent anomalous situation differs from prior 
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ntuations that were consistently explained. This is the only component that is not 
fully implemented. 

While described dM a sequt^atiai yiocw concerned with the development of a single 
explanatory hypothesis, the program's focus may change from one hypothesis to another. 
At some point, the cost of additional work on a poor hypothesis is outweighed by the 
potential of other hypotheses needing further development. A prerequisite to altering focus 
is the ability to evaluate a working hypothecs and prefer one candidate over another. These 
issues are the topics of the next two sections. 

8.2 Preference Criteria 

PHIIEAS is primarily concerned with the interpretation^conitruction task of explanation: 
find candidate explanations and the assumptions on which they rest. However, a system 
that exhaustively generated an unordered set of possible hypotheses would not be of much 
use. It should focus on the most promising explanations first and provide a preferential 
ordering on fully developed hypotheses. 

Two general types of preference criteria are used in PBIIEAS. During the early stages 
of hypothesis development, the only preferential guidance available is the degree of sim- 
ilarity between a candidate analogue and the current situation. This is represented by 
she's evaluation score for the behavioral and structural match between the two ntuations 
computed during access. When determining which of two candidate analogues to consider 
next, the one with the higher sinoilarity score is chosen. This metric supports the simUariiy 
conjecture stated in Chapter 1, which proposes that interpretation-construction tasks may 
be characterised as the search for maximal, explanatory similarity between the rituation 
being explained and some explainable scenario. 

Once an actual hypothesis has been formed (i.e., the result of transfer), the preference 
criterion must change to conrider the characteristics of the hypothecs itself. A complete 
account of theory selection requires consideration of many complex factors, such as a the- 
ory's plauribility, coherence, effect on prior bellcfi, simplicity, and specificity in accounting 
for the phenomenon. Unfortunately, these are significant open research problems in their 
own right, and certainly beyond the scope of this thesis. However, a number of important, 
more specific preference criteria are readily available and have been found useful in PBINEAS 
for establishing preference between competing hypotheses. These are: 

CcE Conjectured entities. Does the hypothesis conjecture the existence of a novel kind of 
entity, and if so, how many? 
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Cvn Vocah\daT]i extermons. Does the hypothesis require the creation of new predicates, 
and if so, how many? 



CcA Composite assumptions. Does the hypothesis conjecture th^ existence of new physical 
processes or new knowledge structures (e.g., schemas, etc.), and if so, how many? 

Cajs Assumed entities. Does the hypothesis assume the presence of a known type of entity 
not mentioned in the original scenario description, and if so, how many? 

Caa Atomic assumptions. Does the hypothesis make additional assumptions about the 
properties and interrelationships of objects in the scenario, and if so, how many? 

The single preference criterion used to evaluate a hypothesis or compare two competing 
hypotheses is a function of these five. The method for combining them is adapted from 
Michalski (1983), who describes the use of a lexicographic evaluation functional (LEF) for 
evaluating alternate inductive concept descriptions. A LEF is a list of elementary criterion- 
tolerance pairs, in which each elementary criterion is applied sequentially to prune the space 
of hypotheses. In PHUEAS, the elementary preference criteria are ordered according to an 
approximate measure of decreasing ^cost^: 

LEF = {Ccs, CvE, CcAy Cae, Caa) 

Thus, an explanation which postulates the existence of a novel kind of entity {Cce) is &t 
all times deemed inferior to one which does not. Each criterion returns a number (N > 
0) as described above, where a value of zero indicates success and a value greater than 
zero indicates failure. The LEF is used to select the most preferable explanation(8) from 
a given set as follows: First, each proposed explanation is evaluated by criterion Cce &nd 
those that pass Cce «e retained. The process is repeated with the next criterion on the 
set of retained hypotheses until only a single hypothesis remaius or the list of criteria is 
exhausted. If at any point all hypotheses evaluated by a particular criterion fail, the process 
stops and the current set is returned in increasing order according to their score N for that 
criterion. 

This evaluative function produces an interesting property when viewed from the per- 
spective of the four explanation scenarios described in Chapter 1: 

1. Deductive scenario. Given phenomenon Vj where V represents a set of observables, 
a complete explanation of V deductively follows from existing knowledge. This cor- 
responds to explanations passing every criterion. 
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2. Assumption scenario. No explanation can be grounded with cunent knowledge be- 
cause not all of the relevant facts are known. However, a complete explanation follows 
from the union of existing knowledge and a consistent set of assumptions about the 
missing facts. This corresponds to explanations passing every criterion but one of the 
last two, Cab ^d Cju* 

3. GeneraUxaUon scenario. Existing knowledge indicates that candidate explanation 
£ cannot apply because condition Ci is known to be false in the current situation. 
However, E does follow if condition Ci is replaced by the next most general relation, 
since CiS sibling is true in the current situation. This corresponds to explanations 
passing the first two criteria, Ccb wd CvBj but failing Cca^ in which a knowledge 
structure is viewed as ^ew^ if it represents a modification of an existing knowledge 
structure.' 

4. Analogy scenario. No candidate explanation € is available directly, but explanation 

is available if a series of analogical assu)n)#tions are made, that is, if the situation 
explained by is assumed analogous to the current situation. This corresponds to 
explanations failing one of the first three criteria, Ccbj Cvbj or Cca^ 

The evaluative function causes PHIIEAS to propose standard, deductive explanations if 
found. In absence, conventional abductive explanations will be preferred. U existing 
theories are insufficient to provide an explanation, explanations adapting knowledge of 
potentially analogous phenomena will be offered. By using analogy as the single source 
for explanation generation, PHI9EAS is able to offer a %esi guess^ in the presence of an 
imperfect or incomplete domain theory. 

8.3 Flow of Control 

In addition to theory selection, preference criteria are important for guiding PHIHEAS toward 
developing the most pronusing hypotheses first. PHIHEAS' globd operation is controlled by 
a task agenda, which maintains an ordered sequence of task-hypothesis pairs. Multiple 
hypotheses in various stages of development may exist at any one time. The task agenda 
ordering determines which hypothesis to expend effort on next, enabling the program's 
focus to change from one hypothesis to another. Repeatedly, the task-hypothesis pair at 

*The iisae of whether to actually create a new knowledge structure or modify the existing one ii an 
important but orthogonal iisue. Here we are concerned with hypothesis evaluation rather than storage of 
an accepted hypothesis. 
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Hguie 8*2: PHIHEAS task scheduling. 



the front of the agenda is selected and executed, resulting in farther development of its 
corresponding hypothesis. This task may in turn spawn other tasks, modify tasks waiting 
for execution, or signal the acceptance of a hypothesis, which halts the cycle. 

Each task is given a priority levd giving rise to the priority lattice shown in Figure 8,2. 
Lower priority numbers indicate increasing precedence. In addition to their normal priority 
levels, the mapping and transfer tasks have an auxiliary score for sorting tasks within the 
same priority level. This auxiliary score is SNE's evaluation scoie for the behavioral and 
structural m^tch between the current observation and the ^.ask's associated analogue. 

There are eight task types currently used in PHIHEAS: access, mapping, 
foxm-unique-mappings, transfer, simulate, simulate-poor, revise, and decision-pool. 
The function of most of these should be evident from their name. For example, the 
transfer task may be paraphrased as: 

Task TRANSFER: Given candidate inference CI, 

For each theory T € Transf er(CJ) 
If Poor-hypothesis? (T) 
then Schedule (Simulate-Poor (70, 6) 
elseSehedule(Simnlate(T)» 1) 

The transfer task produces a set of operational theories from a given candidate in- 
ference and then schedules each of those theories for simulation. If transfer produces a 
poor hypothesiSy the subsequent simulate-poor task is given a priority of 5. Otherwise, 
the simulate task is given a priority of 1. A poor hypothesis is defined as one which 
contains conjectured entities (Cce) or requires the creation of new predicates The 
two simulate tasks aie identical except that simulate-poor additionally reschedules all 
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other siBulat«-poor taslu waiting on the agenda to be normal sisnilate tasks. Due to 
the priority arrangement, the execution of a simulate-poor task indicates that there are 
no ^good'^ hypotheses available. In f uch a situation, all ^poor'' hypotheses are reclassified 
as '<good^ 

f orm*unique*napping8 is invoked when a behavioral comparison is ambiguous, pro- 
ducing multiple correspondence sets. At times, mulUple behavioral matches support the 
same candidate inference. Hence, mapping i^ applied to each and their resulting candidate 
inferences compared, with duplications removed. 

deciaion-pcol is a dedsion- naking task applied to hypotheses found to be adequate 
during the simulation phase. Its priority level is set such that when a decision-pool task 
reaches the front of the agenda, all hypotheses to be considered have completed at least the 
transfer phase of development. The decision*pool task collects all adequate explanations 
existing on tne task agenda and applies the LEF described above to this set. The best 
e3cplanation(s) according to the LEF are returned as PHIHEAS' proposed explanation(s) of 
the observed phenomenon and the cycle halts. 

This priority setting for decision-pool is chosen primarily to enable observation of 
PHUBAS' operation across the entire space of possibilities. It ensures that all of the initial 
analogues proposed by access complete at least the transfer phase. This results in the 
extra work of developing all these hypotheses, but enables application of the LEF to a 
well-developed candidate sc't. If a more best-first approach is desired, then the priority 
of decision-pool may be set to 0, causing immediate acceptance of the first adequate 
hypothesis developed. 

8.4 Disappearing Alcohol Observation: Reprise 

The disappearing «^cohol example described at various points in preceding chapters will 
now be presented in its entirety, showing the diflterent hypotheses PHIHEAS considers, how 
it moves from one hypothesis to ' ler, and how it makes its final selection(s). 

The explanation process begins with a statement of the observation, which is given in its 
entirety in Figure 8.3. At this point in the thesis, the reader has probably assumed that the 
observation corresponds to evaporation. There are two important items to consider about 
tne observation and PHIHEAS' approach to it. First, evaporation is only one of the possible 
explanations consistent with the given information. Second, PHIHEAS does not ^.^ossess 
knowledge about evaporation, so it must examine the situation from that perspective. At 
this point, PHIHEAS' tasx queue contains:^ 

^Hexe I am showing tLie task fonns exactly as they appear when printed by PHIIEAS. The first number is 
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Decreasing (Amount-of (aicohoil ) ] 

Qreater-than (A (Amount-of (aicohoil ) , . 
zero] 

Constant [Change-Rate (Amount) ] 



Substance (aicohoil) 
Contained-Llquid (aicohoil ) 
Container (beaker2} 
Container-of ( aicohoil . beaker2 ) 
Substance-of (aicohoil, alcohol) 
Open (beaker) 



Constant [Amount-of (alcohol 1 ) ] 
EquaMo (A (Amount-of (aicohoil ) ) , 
zs'^l 



Figure 8.3: The ''disappearing alcolior scenario description sl^j. its corre5T>onding observed 
behavior. The amount of alcohol in an open beakei is decreasing at a cc iant rate. Tb's 
activity stops when the amount of alcohol equals zero. 



(7 0.0 (ACCESS OBSfOPEI-ALCOHOL)) 

The access task operates in two steps (Chapter 4). Krst, the behavioral abstractions 
of the observation are used to activate prior experiences. Second, SHE is used to provide a 
closer examination of the N (15) most activated experiences, resulting in their rank > rdering 
and an initial set of correspondences between the current observation and each retrieved 
scenario. The four possibilities discovered and their SHE scores are shown in Figure 8.4. Ai 
tlu. point, PHIIEAS' task queue contains: 

(4 16.73 (MAPPIIG IIAP(BS«gfBOILIIG-BmVIOR,BS«gfOPEI-iLCOHOL-BEHAVIOR))) 

(4 14.90 (MAPPIIG lIAP(BS«g#LIQUIO-DRAIin6-BEEAVI0R,BS«g#0PEI-ALC0H0L-BmVI0R))) 

(4 12.62 (MAPPIIG MAP(BS«giDISS0LVE-BEHA7I0R,BS«g#0PEI-ALC0H0L-BEHAVI0R))) 

(4 11.06 (MAPPIIG MAP(BS«gi2-C0ITAim-I.F,BS«gi0PEI-ALC0H0L-BEHAVI0R))) 

In executing the first napping task, PHIIEAS is given the behavioral mapping compar- 
ing the current open-aleohol-behavior observation to the recdled boiling-behavior 
experience. This exp^'ence describes a pan of water that is boiling on a stove. The first 
step in mapping it to the current situation is retrieving the domain theory used to explain 

the -riority level (one represents the highest possible priority), the second number is the auxiliary auxiliary 
SCO. , for sorting tasb irithic the same priority level. OBSiOPEI-ALCOHOL indicates the presence of a 
LISP sttnctscc holding the data for the observation "open-alcohol". 
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Dissolving salt 



Leaky cup 



2 container liquid flow 



Figure 8.4: Candidate analogues for the ^disappearing alcohoF scenario returned by the 
access task. 



analogous aspects of its behavior. In this case, there were three processes used in the prior 
boiling explanation: oaUingj heat flow^ and heat repkni$h. The boiling process specifies the 
vaporization of a liquid when at or above its boiling temperature a' a rate qualitatively 
equal to the rate of heat flow into the liquid. The heat replenish process is a common 
technique &r modelling the behavior of an ideal heat source in QP theory by resupplying it 
with heat as fast as it is depleted. Where this added heat is coming from is not considered. 

The second step in mappi**'* these processes to the current situation is to declare 
the correspondences sanctio^^^ by access. These are (pan7 ^ b«ak«r2), (bwat«rl 
^ alcoholl), (wat«r 4-4 alcohol), (aaount*of ^ aaount-of), and (cliang«-rat« ^ 
change-vat*). 

The third mapping step invokes SHE, which returns the gmap shown in Figure 8.5. The 
initial explanation for the disappearing alcohol, via analogy to boiling water, appears in the 
candidate inferences field. It proposes that something like boiling is removing alcoholl 
from boaker2. However, this mapping and its candidate inference must be analyzed. At 
this point, PHIIEAS' task queue contains: 



(3 16.73 (TRiBSFBR IIiP(BSagtBOILIIG-BBHAVIOR»BSegiOPEI*iLCOHOL-BmVIOR))) 

(4 14.90 (HiPPnC lIAP(BSagtLIQUir-D^IIII6-BmVIOR»BSegtOPEI-iLCOL3L-BEHAVI0R))) 

(4 12.62 (HiPPIIG lUP(BSag«DISSOLTE-BEHiVIOR»BSagtOPEI*iLCOHOL-BmVIOR))) 

(4 11.06 (HAPPIia lIAP(BSagt2-C0ITAIIER-LF,BSag*0PEI-iLC0H0L-BEHAVI0R))) 
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Figure 8.r: S¥Zcsm mapping from the boiling processes (soiling, heat flow and heat re- 
plenish) to the disappearing alcohol scenario. 
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When tranifer is attempted on the boiling hypothesis, all of the expressions aie con- 
sistent, but two skolem objects are found: (ttkolea stored), the ideal heat sourcci and 
(:skolMi bsteaal), the vapor produced from boiling. When the abductive retriever is used 
to seek possible analogues for these objects in the current situation, no candidates are found. 
Therefore, PHUEAS assumes a new entity token for each unknown object and confirms that 
the candidate infeisnce remains consistent upon their installation. The transfer operation 
is completed by the eacplidt assumption of the now operational hypothesis. PHUEAS first 
determines that the three proposed processes are identical to the original boiling, heat flow 
and heat replenish processes (they were never changed through mapping and transfer) and 
removes them from the hypothesis. It then makes the following set of atomic assumptions: 

(TBEBIAL-OBJBCT SK-ST0TE9-1) 
(mT-SOQRCB SK-ST0VE9-1) 
(CORAZIED-GIS SK-BSTEAMl-1) 
(COmHER-OF SK-BSTIill-l BEiIEa2) 
(SDBSTJuiCB*OF SK-BSTEill-l ALCOHOL) 
(HEAT-COIlECnOH BEAKEB2 SK-ST0VE9-1 ALCOHOLl) 

The transfer task then schedules a simulate task for the hypothesis. Since the hypothesis 
does not contain conjectured eutities {Cce) or create new predicates {Cvs)j sinolate is 
scheduled rather than sianilate^pcjr. At this pmnt, PEIHEAS' task queue contains:^ 

(2 16.73 (SnOUn THfQPEI-ALCOHOL-BEBA?IOR*THEORT-l-l)) 

(4 14.90 (HAPPIH6 HAP(BSeg#LiqUID-DBAIIIHG-BEBAVIOR,BSeg#OPEI-ALCOIOL-BEHAVIOR))) 
(4 12.62 (HAPPIHG HAP(BSeg#DISSOLTE-BEHA?IOR,BSegfOPa-ALCOHOL-BEHAVIOR))) 
(4 11.06 (HAPPIHG HAP(BSeg#2-C0HTAIHER-LF,BSeg#0Pa-ALC0H0L-BEHAVI0R))) 

The simulate task invokes QPE on the ^ven hypothesis to envision its predictions for 
the current scenario. This is ':^'>wn in Figure 8.6. State S2 describes the alcohol below 
the boiling point and heating up. State SI describes the alcohol boiling, with its amoant 
decreasing at a constant rate and the amount of hypothesized alcohol steam increasing. 
Both S2 and SI are able to transition to state SO. QPE individuates states in an envisionment 
by the derivatives of quantities and the active processes. For this scenario, there are two 
distinct situations having no active processes and derivatives equal to zero. In situation 
S0*1, the alcohol haa completely boiled away, leaving nothing more to boil. This is the 
condusira observed in the disappearing alcohol scenario. Alternatively, SO-2 describes the 
alcohol and stove temperatures equalizing, again leading to a halt in active behavior. When 

'The current working theory, THfOPEI-ALCOHOL<*BEHAVIOR-THEORT-l-l, is the result of transferring 
TIf 0Pa«ALC0HOL-BEIAVI0R-THE0RT-l, the ori^nd candidate infereiic<s derived from mapping. 
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Processes: 

PIO : HEAT-FLOW (SK-ST0VE9-1 , ALCOHOLl , BEAKER2 ) 
PIl: HEAT-REPLEHISH(SK-STOVE9-1,PIO) 

PIS : B0ILIIG(ALC0H0L,BEAKER2 , ALCOHOLl , SK-BSTEAMl-1 ,PIO) 

Figuie 8.6: Envirionment of the boiling hypothesis foi the disappearing alcohol observation. 
Since QPE distinguishes states by the derivatives of quantities and active processes, state 
SO is actually two difference situations (SO-1 and SO-2), but both have the same derivative 
and process characteristics. !>■[. . .] denotes the sign of the derivative. indicates a 
quantity is undefined during that state (e.g., its process doesn't exist) and indicates 
that any value is possible. 
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OATNI is lued to drtermine adequacy, it finds that the envisionxnent is both complete and 
coniistent with respect to the observed behavior. At this pmnt, PHIIEAS has constructed 
an adequate explanation of the situation. However, due to the task settings in use, it 
continues to examine other candidates. Its task queue now contains: 

(4 14.90 (HAPPnC lIiP(BS«g#LIQTJIS-DRAnn6-3mTI0R.BS«gi0PEI-iLC0Er'L-BEIlTI0R))) 
(4 12.62 (RAPPII6 RiP(BS«giDISSOLVE-BmVIOR.BS«g#OPEI-iLCOBOL-BEHlTIOR))) 
(4 11.06 (RiPPIIG RlP(BS«gt2-C0miIER-LF.BS«g«0PEI-ALC0H0L-BmTI0R))) 
(6 0.00 (DECISIOI-POOL THiOPn-ALCOHOL-BEBATIDR-THEORT-l-l)) 

The second candidate analogue proposed during access is a leaky cup: perhaps the 
alcohol is leaking through a hole in the beaker. When the leaky cup's explanation is 
nu4>ped, two processes are proposed, corresponding to liquid flow and liquid drain (an ideal 
sink - the opposite of heat replenish described above). The transfer task applied to this 
hypothesis finds that the individual expressions are consistent, but the candidate inference 
contMns three unknown objects: (:skolMi siskS), the destination of flow, (:skol«B 
cs-siak6), the destination liquid, and ( :tkol«a hol«7), the fluid path leading out of tLe 
beaker. In each case, no analogue can be found and a new entity token is assumed. Once 
again the proposed processes are found to be identical to their original base analogues and 
the final operational hypothesis consists of the following set of assumptions: 

(Liquio-siiK sK-sms-i) 

(ClI-CORAII SK-SmS-l ALCOHOL) 

(comiiED-LiquiD sE-cs-sms-i) 

(CORAHED-FLniD SI-CS-SIIES-l ALCOHOL SI-SIRS-l) 

(cqiTAim-oF si-cs-snis-i si-snu-i) 

(Pi^ICAL-PATB BEAIER2 SI-SHIS-l SI-H0LE7-1) 
(FLUID-PATH SI-H0LB7-1) 
(FLUID-ALI6HED SI-H0LB7-1) 

PHIIEAS now executes the sioulate task on the leak hypothesis. Its envisionmcut is 
shown in Figure 8.7. State SI indicates that alcohol is flowing from the beaker to the 
ideal sink, sk-sisk5-l. The alcohol's amount is decreasing, the pressure in the beaker 
is decreasing (taken at the beaker's bottom), and the flow rate is decreasing. State SO 
indicates that the amount-of alcohol is equal to zero, all quantities are constant, and there 
are no processes active. 

When the hypothesis is compared to the original observation, an anomaly is found. The 
alcohol's rate of change was constant in the observation, yet the proposed model predicts 
that the change rate increases (becomes less neg&tive) as the amount of alcohol decreases. 
Consequently, PHIHEAS schedules the h.'ak hypothesis for revision. The task queue now 
contains: 
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Processes: 



PIO: LIQTJID-PL0H(lLC0H0L,BEim2,ALC0H0Ll,SK-Sin[5-l,SK-CS-SIIK5-l,SK-HnLE7-l) 
PIl: LIQUID-DRAUCSI-SmS-l.SI-CS-SinS-l ,PIC 

Figure 8.7: QPB envisionment of the "leaky container" hypothesis for the disappearing 
alcohol observation. 
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(4 12«62 (MiPPIIG HAPCBSHiDISSOLVE-BEIATIOR^BS^gtOPEI-iLCOEOL-BEEAVIOR))) 
(4 11.06 (MiPPIIG lliP(BS«gt2-CaiTAIIER-LF,BS«gtaPEI-iLC0E0L-BEHAVI0R))) 
(6 0.00 (DBCISIOI-POOL THtOPEI-ALCOEOL-BEHlVIOR-THEORT-l-l)) 
(8 14«90 (BBTISE TB#0PEI-ALC0E0L-BEEiVI0R-TEE0RT-2-l 

((lO-HTEIPRET ALC0E0L-60II6) (OCCURSAT QSTATE-O ALCOEOL-DRT)) 
(ALC0E0L-60II6 ALCOEOL-DRT))) 

The next hypothesis considered is the dissolving analogue (salt dissolving in water), 
which has been discussed throughout the prec^^ ^mg chapters. The mapping task produces 
a candidate inference that is flawed by inconsistent predicate use (e.g., (InMrf«d-in 
alcoholl ?waierl)) and the presence of an unknown correspondent for the water. During 
the transfer task, the atmosphere is found as a potential analogue for the water and 
mapping is repeated. In the second transfer pass, all object correspondences are known. 
All that remains is to resolve three inconsistent expressic. -^z 

(Solution ataotph^n) 
(Soluble alcoholl) 
(Solubl«-in alcoholl ataosph^ra) 

which become 

(Solution-8 ataoaphara) 
(SE*Solubla-4-l alcoholl) 
(SE*Solubla-in-4-l alcoholl ataotphara) 

Having produced an operational hypothesis from the dissolving candidates inference, the 
transf ar task schedules it for simulation. Due to the new predicates required {Cve)^ it is 
scheduleiil for the sinulata^oor task. The task queue now contains: 

(4 11.06 (IIAPPII6 MAP(BSag#2-C0BTAIEER-I.F.BS^#0PEI-ALC0E0L-BESAVI0R))) 
(5 0.00 (OECISIOE-POOL TBfOPa-ALCOEOL-BESAVIOR-TEEORT-1-1)) 
(6 12.62 (SmUUTB-POOR TBfOPa-ALCOEOL-BEEA?IOR-TEEORT-4)) 
(8 14.90 (REVISE TEfOPEE-ALCOSOL-BEEAVIOR-TEEORT-2-1 

((EO-IBTEIPRET ALC0E0L-60IE6) (OCCORSAT QSTATE-O ALCOEOL-DRT)) 
(ALC0E0L-60II6 ALCOEOL-ORT))) 

The final analogue considered is a standard liquid flow behavior between two contain- 
ers connected bj a pipe. It results in theory OPEE-ALCOEOL-BEEAVICR-TEEORT-5-1, which 
conjectures that the alcohol is flowing to another container through a fluid path connecting 
them. A description of PHZIEAS invoking aapping, tranaf or, and ainulata on this ana- 
logue would be redundant with the pri:>r description of the leaky cup h]rpothesis (including 
its rejection due to a non-constant change rate). Hence, it will not be repeated here. 

At this point, each candidate analogue has completed at least the transfer phase of 
operation. The task queue now contains: 
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(S 0.00 (DECISIOI-POOL THtOPEI-iLCOHOL-BEHlVIOR-THEORT*l*l)) 
(6 12.62 (SnUUTE-POOR THtOPU-iLCOHOL*BEEA¥IOR-THEORT*4)) 
(8 14.90 (REVISE TItOPEI-ALCOHOL*BEHAVIOR*THEORT-2-l 

((IO-HTERPRET ALCOHOL-GOIIG) (OCCORSAT qSTATE-l ALCQHOL-ORT)) 

(ALC0H0L*60II6 ALCOHOL-DRT))) 
(e 11.06 (REVISE THtOPEI-ALCOHOL-BEBA¥IOR-THEORT*5*l 

((I0-II7ERPRET ALC0H0L-60II6) (OCCURSAT QSTATE-l ALCOHOL-DRT)) 

(ALC0H0L-60II6 ALCOHOL*DRT))) 

The priority ci the docision-pool task guarantees that once a decision-pool task 
reaches the front of the agenda, PHUEAS has reached the hypothesis selection point. The 
first decision-pool task (boiling) collects all other hypotheses from the task agenda wait- 
ing for decision-pool ezecntion. In this case there are no others. It then applies the 
LEF to this set. Because there is only one candidate explanation at this point, boiling is 
selected as the final explanation. 

As shown in Chapter 6, had the dissolving hypothesis been simulated, it too would 
have produced an adequate explanation. However, it was deemed inferior to the boiling 
explanation because of its creation of new predicates {Cve)* It was given a prominent 
position in preceding chapters due to its demonstration of a large percentage of the main 
ideas in this thesis.^ 

8.4« 1 Discussion 

Variations on this example have been used to explore PHIHEAS' range of behavior with 
differing amounts of information: 

• With observation duration. A phenomenon's duration is often an important key to 
theory development.^ In this case, it took 15 hours for the alcohol to completely 
disappear. For the sake of this example, a prototypical^ amount of time a beaker of 
alcohol takes to boil away was chosen to be the range of 30 minutes to 4 hours. When 
this information is included, the boiling hypothesis is accepted with the warning 

Hypothesis OPSf-ALCOBOL-BSRAVIOR-THBORT-l-l is adsqaats, bat fails dazation bounds 

At selection time, adequate candidates consistent with the observation's duration are 
preferred over those having an inconsistent duration range, xn this particular example, 

^As a side note, the dissolring Iqrpothesis came about as a compete torprise. When the ""evaporation'' 
<»mple was ftxst attempted, I expected PHUEAS to propose boiling or liqaid flow. When diisolring 
appeared as a candidate analogue as well, I initially considered this a flaw since it seemed clear that 
dissolving was not analogous. Further consideration reveals numerous analogous aspects between the two 
situations. 

^DATMI is able to use real-Tslued duration information to influence measurement interpretation. 
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boiling is the ostly hypothesis and is therefore accepted despite the unusually long 
time it t'jok the alcohol to vanish. At the present time, this is the only example 
which takes advantage of DATMI's ability to reason about the durations of events. 
The approach represents an important aspect of the theory development problem. 
More work is needed to better understand its applications. 

Without change raU informaiion. When the derivative of the alcohol's change rate 
is unknown (i.e., only the alcohol's decrease is observed), the leaky container and 
liquid flow hypotheses become adequate explanations. When PHIIEAS reaches the 
hypothesis selection point, there are three candidate explanations to choose from. 
When, the LEF is applied to this set, all hypotheses pass the first three criteria {Cce, 
Cyb, Cca)' They all fail the fourth criteria, assumed entities (Cae). The leaky cup 
and two container liquid flow explanations assume three objects, while the boiling 
hypothesis assumes two. Thus, boiling is selected as the best explanation. 

WUh temperature information. When the alcohol's temperature is included as an ob- 
served quantity ( T«ip«zatnz«(alcolioll) < Tboll(alcoholl)), the boiling hypoth- 
esis is invalidated. This is reflected by transfer proposing a new process analogous 
to b<nling which is active for temperatures below the boiling temperature. The new 
process is otherwise identical to b<»ling. 

With temperature, without change rate. When the change rate information is not 
included, the leaky container and liquid flow hypotheses become adequate. When the 
alcohol's temperature is included, the boiling analogy produces a new kind of boiling 
proo^ for temperatures below the boiling temperature. Thus, it fails the composiU 
assumptions {Cca) criterion. As a result, PHIIEAS rejects the ^'boiling" explanation 
and selects the leak and liquid flow hypotheses. Each is identical under the LEF, as 
they both assume the existence of three entities. 
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Chapter 9 
Examples 



PHIHEAS hfts been used to hypothesize explanations of observations from a variety of phys- 
ical domains and situations. Previous chapters have focused on its investigations of evap- 
oration^ via examples of alcohol's behavior in open and closed containers. This chapter 
describes PHIHEAS' operation for several additional examples. It also analyzes the success 
and limitations of each example. 

9.1 Caloric Theory of Heat Flow 

The view of heat as a material substance originates firom the Greeks and doiiiinated thermal 
science during the eighteenth and first half of the ninteenth centuries (Roller, 1961 ). During 
the eighteenth century, it developed into what is generally called the caloric theory of 
heat, which postulates a heat substance called caloric. The temperature of an object was 
believed proportional to the amount of caloric present. Furthermore, caloric tended toward 
equilibrium, causing it to flow between bodies placed in contact until an equilibrium of their 
temperatures was achieved. This section describes how PHIIEAS achieves a naive level of 
the caloric view when shown thermal behavior for the first time. 

The explanation task is illustrated in Figure 9.1. When a hot brick is immersed in cold 
water, their temperatures will asymptotically approach each other until reaching equality. 

PHIHEAS begins by searching memory for analogous behavior.^ First, the behavioral ab- 
stractions describing the observation are used to probe memory. In this case, 
dual*approach-f inish applies, which characterizes two quantities asymptotically ap- 
proaching each other and reaching equality. Only one candidate analogue demonstrates 

^PHIIEAS' defatilt domain theory includes three thermal processes: heat flow, boiling, and heat- 
replenish. They are removed for this example. 
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(Physob brick) 
(SoUd brick) 
rToliai«-toUd brick) 
(Liquid wat«rl) 
(ContmiiMd-liquid vat«rl) 
(Coiitaiii«r*-of vat«rl bucket) 
(8iibstiBC«^f watMl vat«r) 
(Intrstd-in brick vat«rl) 
(CoBtaiiMd-iii vat«rl backet) 
(])iui-«|ipro«ch-fizii8h 2-obj-hf ) 
(NMts (Situation 2-obj-hf-sitO 

(8«t (D«cr«uixig (Trap«ratiir«-iii brick)) 
(Incrouixig (Tnp«ratiir«-iii v«t«rl)) 
(6rMt«r-Than (A (T«qm:«tiir«-iii brick)) 

(A (Ttt^cr«tiir«-iii v«t«rl))))) 
(Situation 2-obJ-hl-»itl 

(Sat (Conatant (Taaparatura-in brick)) 
(Conatant (Taiparatiira-iii vatarl)) 
(Equal-to (A (Taqparatura-in brick)) 

(A (Taq^tura-in vatarl)))))) 

Figure 9.1: An unexplained thermal situaiion. When a hot brick is immersed in cold water, 
the brick's temperature decreases and the water's temperature increases This transitions 
to a state in which the temperatures are constant and equal. 



this abstract behavior - 2-containcr-liquid-flow. This scenario describes liquid flowing from 
one container (beakerS) to another (Tial2), through a pipe (pipel) connecting them. Us- 
ing SHE to compare the current and r ailed situations, PHINEAS determines that the roles 
of the beaker and vial in the liquid flow description correspond to the roles of the brick 
and water in the thermal situation, respectively. Additionally, it finds that pressure in 
the liquid flow situation corresponds to tanperature in the thermal Situation. 

Upon completion of access, PHIHBAS attempts to map the relevant liquid flow domain 
theory into the current thermal situation. First, the domain theory used to explain the 
2-container-liquid-flow experience is retrieved. This consists of the liquid flow process and 
two instantiatiors of 

(Coatainad-Fluid eantamed^fluid nibstance container) 

one for the beaker water and one for the vial water. The partial mapping established during 
access is then declared and SHE invoked. SME's results are shown in Figure 9.2. Its candidate 
inferences propose a new contained-fluid relationship, in which the temperature of the 
container (brick and water 1) is proportional to the amount of substance it contains. This 
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Wi^gktt 3.0710 
C«mdidftt« Zif t 
(1-IZFUZBI 

(fiT (PACiiT-oiramoi 

(COITAXnD-rLinD (:8I0UDI eS-fflTn-BliKB) (tSKOLn VATID BKZCI) 
(SR (mSOl (tSKOLn CS-fflTn-UAKB)) 

(coiTAzm*or dsioLn cs-viTn-Biiin) buck) 

(lO MTAlC lHg (tflOLBI Cf-ffATn-UAm) (iSKOLBI VATIt)) 
(qUAITXTT <AIOUIT-OP (tlKOUl CS-ff ATBt-BBAItD ) ) 
(qOAITZTT TnriUtUU-ZI-118) 

(qraOP TUFIUTUU-ZI-IU (AIOOIT-Or (iSKOLBI C8-VATn-BIAKEI))))) 

(PACUT-oirziznoi 

(conAXRD-rtuzD. (limn cs-wath-txal) (iskolii vatbb) ^iimi) 

(IR (mSOB (tIKOLBI CS-ffATn-TZAL)) 

(CORAZm*OF (tilOLn CS-ffATn-fZAL) WATIll) 
(mtTAICB*OF (tSKOLBi CS-ffATBt-VZAL) (tSIOLIK VATBE)) 

(quAmTT (Auon-op (tSKoui cs-VAm-mD)) 
(qVAITZTT Tmu7au«-zi-ii7) 

(qPMF TIIPBUTDU-ZI<-117 (AlOUIT-OP (tfllOLn Ca-VATBA-VZAL))))) 
(PBOCBSS-OBFZBZnOB (ilBOLBI UqUZD-PLOW) (tSIOLBBm) 
(ZBPLZBS 

(ABD (ZBDZVZOOIL (liEOLBIlAnD 

(COBOZnOBi (SOBSTABCB (tilOLa VATBR)) (LZQUZD (sSIOLBB VATBK)))) 
(ZBDZYZBOlIi BUCB (COBDZnOBi (CU-COBTAZB BUCK (:8K0LKII VATBR)))) 
(ZBPZTZOOIL (tSXOUDI CS-BAT8B-BBAKBB) 

( COBDZT ZOBi (COBTAZBBD-PLUZD ( 1 8K0L8I CS-VATBK-BBAKBI) (sSKOLBI VATBK) BUCK))) 
(ZBDZTZBOIL VATBBl (COBBZTZOBS (CAB-COBTAZB VAT8K1 (tSKOLBI VAtn)))) 
(ZBBZTZSIUXi (iSKOLSI CS<-VATBl-fZAL> 

(COBDZTZOBi (COBTAZBBD-nOZD ( 1 8K0XJBI CS-VATBB»nAL) ( 1 8K0LBR VATKB) VATBKl))) 
(ZBDZTZOOa (tlKOUl PZPBl) 

(COBBZTZOBS (ILOZB-PAn (tSKOLBK PZPBl)) 

(rarSZCAL-PATV BUCK VATBKl (tSKOUB PZPBl)))) 
(FLUZD-AUaBBD (tSBOLBI PZPBl)) 

(aBBATBB-TBU (A TBIPSBATDBB-ZB-IU) (A TBHPBUT0KB-ZB-tl7)) 
(0BBAT8B-TBAB (A (AKOOBT-OP (tlKOLD C8-VATBK-BBAKKK))) ZBKO)) 
(ABD (QUABTZTT (nOV-BATB (tlKOLBI PZl))) 

(Q- (ftOV-BATB (tSKOLBI PZD) ('^ TBBPBUTOU'-ZB'-llS TBBPBBATUU-IB-117)) 
(QBBATBB- TBAB (A (HOV-BATl (tiXOLBB PZl))) ZBKO) 

(CTKABi (AlODBT-OP (tSKQUDI C8-VATBK-BBAKD)) (AXOUBT-OP (:SKOLBB CS-WATBK- VIAL) ) 
(A (PLOV-UTB (tlKOLBBPZl)))))))) 

3-OBJ-BF) 



Figure 9.2: SKEcsm mapping from the liquid flow process instance to the hot brick in cold 
water scenario. 
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rabstance is curtently unknown, but is analogous to the water in the liquid flow situation. 
Additionally, it proposes a new process: when two objects of differing temperature are 
connected by a physical path, the unknown substance continuously flows from the object 
of higher temperature to the one of lower temperature, at a rate equal to their difference 
in temperatures. 

The candidate inferences are then passed to the transfer task. First, it determines 
that none of the proposed expressions are inconsistent in their current state. Next, these 
inferences are inspected for the presence of skolem objects. Four are found: (:skolAm 
es-wat«r-b«ak«r), (:ikcl«« ci-wat«r-Tlal), (:ikolMi vat«>r), and (: skolem pipcl). 
The first two are compound objects (i.e., objects defined solely by their constituents) and 
are therefore ignored. The unknown (:skol«a plp«l) indicates that no correspondent for 
the pipe connecting the beaker and vial wan found. However, whsn the abductive retriever 
is ^ven the task of solving the conjunction 

(Phyii?tl-Path brick wat«rl fpipe) A (Fluid-llign«d fpipe) 
A (Fluid-path fpipe) 

it finds (Phyiical-Pati brick wat«rl (cow>n-fac« brick wat«rl))and (Fluid-i.lign«d 
(coMon-f ac« brick iat«rl)) are true in the current scenario and the ♦hird conjunct is as- 
sumable. PHIIEAS therefore establishes (coB»>n-fac« brick watcri) as the analogue for 
pip«. 

The remaining unknown, (:ikol«i wat«r), indicates that no correspondent for the 
water flowing from beaker to vial was found. Additionally, no correspondent is found when 
the abductive retriever is invoked on the coi\;unction 

(Snbstaac* fpipe) A (Liquid fptpe) A 

(Can-Contain brick fpipe) A (Can-Contain watcrl fpipe) 

When cr«at«-«ntity is used to make a new entity token for the missing water corre- 
spondent, a contradiction is found: 

(liquid sk-water-l) A (VoluiM-Solid brick) -i (Can-Contain brick sk-wat^r-l) 

Asaresult, (Liquid sk-wat«r-l) is changed to (SK-Pha>«-1 sk-water-l), with SK-Phas*,-! 
added as a new child predicate to Phase. 

At this point, the transfer task is completed, resulting in the model shown in Figure 9.3. 
This model postulates that tLc orick and water each contain sk-waterl-l, and their tem- 
peratures are proportional to how much of it they contain. Additionally, it proposes th,5 
new Proe«ss-l, which miflit be called a heat flow process. It indicates that sk-waterl-l 
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Bypotlicsts lor thtory 2-OBJ-HF-THEORT-l-l dtrivtd lro« 2-COmnER-LF: 



(DBFP&OCESS (PROCBSS-1 TSUBST rSOURCB ?SRC-CS TDESTIIATIOI TDST-CS TPATH) 
IBDCTIODILS ((rSDBST tCOIDZTIOIS (SUBSTilCS TSUBST) (SX-PHiSE-1 TSUBST)) 
(TSOURCI :COIDmOiS (CAI-COITill TSOU&CE TSUBST)) 
(TSRC-CS :COIDITIOiS (Cr TiJilSD-FLUID-1 TSRC-CS TSUBST TSOURCE)) 
(TDESTHiTIOI :COIDmOlS (Cil-COITill TDESTHATIOI TSUBST)) 
(TDST-CS :COIDITIOIS (COITillSD-FLDID-l TDST-CS TSUBST TDESTIIATIOI)) 
(TPATH :COIDITIOIS (FLUID-PATH TPATH) 

(FHTSICAL-PATH TSC3BCE TDESTIIATIOI TPATHU) 
PRECOIDinOIS ((FLDID-ALI6IED TPATH)) 

qUAITITTCOIDITIOIS ((CREiTER-THAI (A (TBMPERATURE-H TSOURCE)) 

(A (TBMPERATURE-n TDESTIIATIOI))) 
(6RBATEE-THAI (A (ANOUIT-OF TSRC-CS)) ZERO)) 
REUTIOIS ((QUAITm (FLOV-RATE TiiELF)) 
(Q« (FLOV-RATE TSELF) 

(- (TEMPER ^TURE-H TSOURCE) (TEMPERATURB-II TDESTIIATIOI))) 
(GREATER-THAI (A (FLOV-RATE TSELF)) ZERO)) 
IIFLUEICES ((CTRAIS (ANOUIT-OF TSRC-CS) (AMOUIT-OF TDST-CS) (A (FLOV-RATE TSELF))))) 

(DEFBITITT ( 20ITAIIED-FLUID-1 TT-1 TT-2 TT-3) 
(COmilER-OF TT-1 TT-S) 
(SUBSTAICE-OF TT-1 TT-2) 
(qUATTITT (ANOUIT-OF TT-1)) 
(qUAITITT (TENPERATURE-II TT-3)) 
(QPROP (TENPERATURE-II TT-S) (ANOUIT-OF TT-1))) 

(ASSUNE (SUBSTAICB SK-VATER-1)) 
(ASSUNE (SK-PHASB-1 SK-VATER-1)) 
(ASSUNE (CAi-COmH BRICK SX-VATER-1)) 

(i^SUNE (COITAHED-FLVID-I SK-CS-VATER-BEAKER-1 SK-VATER-1 BRICK)) 
(ASSUNE (CAI-COnAH VATEEl SK-VATER-1)) 

(ASSUNE (COmmD-FUnD-l SK-CS-VATER-TIAL-1 SK-VATER-1 VATERl)) 
(ASSUNE (FLUID-PATH (CONNOI-FACE BRICK VATERl))) 



Figure 9.3: Final PHIHEAS hypothesis eTnlaining the hot brick in cold wate: behavior. 
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Figure 9.4: Envisiomnent produced by the hypothesized caloric model when applied to 
the brick immersed in water scenario. Some states are split by QPE (e.g., S2 and S2-I). 
States are distingtushed only by derivative and process values. They are split when this 
distinction produces a state lasting an interval of time (S2) and also lasting for an instant 
(S2-I). 



will flow from the object of higher temperature to the object of lower temperature. PHIIEAS 
does not perform generalization beyond replacing constants with variables.^ Hence, only 
the brick and waterl are believed to contain sk-waterl-l. 

Only one ;est remaixu - verier the adequacy of the model in explaining the original 
observation. As shown in Figure 9.4, the model produces a five-state envisiomnent, \ i 
state 32 tranrltioning to state SO demonstrating that the model is able to predict the 
observed temperature changes. In state S2, Proc«ss-l is active, the substance sk-watar-l 
is flowing from the brick to the water, and the temperature of the brick is decreasing while 
the temperature of the water is increasiiig, each at a rate equal to the difference in t\eir 
temperature. In state SO, the brick and water temperatures are equal and all quantities 
are constant. 

'The ezplaaatioa-based learning eonuaunity has shown that this is not sufficient to ensure proper 
generalisation (DeJong k Mooney, 1986; Bfitchell a al., 1986). Explanation-based generalisation should 
be performed at this * * «vUt has not been a problem so bn. 



RIC 



169 



1^ 



9.1.1 Discussion 

Tliis example demonstrates several points. First, identifying the path of thermal flow shows 
the utility of transfer in filling out an incomplete analogical mapping. An analogy will often 
evoke additional reminding or perspectives made relevant by its consideration. Second, the 
example illustrates PHIHEAS' ability to create new object tokens (i.e., sk-vatar-l) wLcn a 
skolem object produced by mapping cannot be resolved. Further, it is able to distinguish 
between assuming the presence of an unobserved object and conjecturing a theoretically 
novel entity. This is important information for theory evaluation and selection processes. 

The example also points to an interesting aspect of PHIIEiS' behavior. As in any knowl- 
edge intensive approach .o learning, PHIHEAS' results are dependent on the knowledge it has 
and how that knowledge is expressed. In the model shown in Figure 9.3, two questionable 
relations appear: 

(Fluid*Path ?path) and (Fluid*alignad ?path) 

These are odd predicates for a theory of heat. Ideally, we would like to see both replaced 
by new predicates representing the concepts of heat path and heat aligned. However, they 
are consistent with the current example, since ?path is the common surface between the 
brick and water in this instance. 

There are several ways in which the prefered result could have been achieved. First, 
if PHIHE. ^ had been asked to explain a situation where a solid mexal bar was acting as 
the path, its inability to act as a fluid path would have been detected and new path 
predicates created. Second, the current consistency of the two predicates may be ^ewed 
as a flaw in the domain theory. Uring Fluid-Path as a one place predicate and stating 
it as a precondition to Procass*! has detached its mean.* g from the particular fluid it 
is to transport. Fiiially, important experential information is lacking, since this is the 
first and only time PHIHEAS has encountered thermal flow. The plausibility of PHIHEAS' 
conjectures must be evaluated with respect to the information it possesses. If sk-vatar-l 
is thought to be a fluid, there is no evidence indicating that the sk-watar-l fluid requires 
something different than standard fluid paths (e.g., heat flow through a solid metal bar). 
Often, seve^'al examples ox a phenomenon are necessary to isolate the conditions in which 
it occurs. 



9.2 Oscillation 

Oscillation is a comn:9n phenomenon in physical systems. PHIHEAS' initial knowledge 
contains theories about a prototypical spring-mass system, in which a spriag is anchored 

170 



ERiC l^c 



Oscillator 



Period of Oscillation 



Compliant 
Element 



Inertial 
Element 



Spring-Mass Oscillator 



T=2Tr>/-f- 



Spring (k) 



Block's mass 
(m) 




Torsional Disk's moment 
Stiffness (K) of Inertia (J) 



Torsional Pendulum 




T = 2ir 



^3EI 



Flexural 
Stiffness (EI) 



Ball's mass 
(m) 



Cantilevel Pendulum 




L-C Electric Circuit 



= 2TrV 



Compliance Induaor (L) 
(1/C) 



Figure 9. : Some simple harmonic oscillators. (Adapted from Shive & Weber, 1982; Thom- 
son, 1948 j 



to a wall on one cad and attached to a mobile mass on the other. If the block is pulled 
and then released, it will oscillate back and forth forever.' Drawing from tLis knowledge, 
PHIIEAS has been able to explain several examples of simple harmonic •notion, all depicted 
in Figure 9.5. 

H^e we consider the behavior of a torsion oscillator. PHIHEAS is initially given a de- 
scription of a ball rotating while suspended 1^ a string, and the ball's sinusoidal behavior, 
rq)resented as a cycle of eight qualitatively described temporal intervals. Each interval 

'Modelisg friction and resittaaee in otdllaton is a difficult modeling problem in QP ***«ory. Ideal 
otcillaton ate disenned thtonghont tt>is section. 
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Torsional Pendulum 



(Conntcttd itringl ball9) 
(String gtringl) 
(BaU b«U9) 
(Rotating-Object baU9) 
(TYisting-ObJect stringl) 
(Sinusoidal ball-oscillating) 



o 
o 
o 




(Situation ball-string-s3 

(Sat (Decreasing (Angnlar-displacefflent stringl)) 
(Decreasing (Angle baU9)) 
(Increasing (Angular-Velocity ball9)) 
(Less-than (A (Angular-Telocity ball9)) ZERO) 
(Less-Than (A (Angle baU9)) zero) 
(Less-Than (A (Angular-displacement stringl)) zero))) 



o 
o 
o 



Figure 9.6: A torsional oscillator. 



contains facts describing the derivatives and amounts of angle and angular velocity. In 
addition, it is told that the ball is a rotating objec* and the string is a twisting object. 

When PHIHEAS probes memory for prior experience with sinusoidal oscillation, it finds 
the spring-mass system. A detailed comparison of this behavior and that of the rotating 
ball reveals a correspondence between the eight behavioral states of each system. This 
correspondence indicates that the compressing spring corresponds to the twisting string 
and the translating block corresponds to the rotating ball. Additionally, position is mapped 
to angle and vdodty is mapped to angular velocity, due to their ^.imilai behavior. 

With a behavioral correspondence established, PHIHEAS fetches the domain theory used 
to explain the spring*mass system. This consists of a Force process which applies the 
spring's force to the attached block, a spring-maas-systam object definition describing 
the system's total energy and the relationship between the block's position and the spring's 
displacement, and a spring object definition describing its restorative force as a function 
of displacement. When the spring-mass theory is mapped into the oscillating ball situation, 
transfer first examines each relation and finds no inconsistencies. The transfer phase next 
checks for skolem objects ir the candidate inference and finds (rskolea sm-sys). am-sys 
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Bypoth«g«t lor tlwoxy BAU.-OSCILLATIlG-THEORT-3-l d«riv«d froa SPEIi6-MlSS-0SCILLATIIG. 

(OBFBITin (8Pin6-NiSS-8TSTEM-22 "1-78 ??-79 ??-80) 
(COnECTED ?f-79 T?-80) 

(qvArrm (TOTiL-smcT t?-t8)) 

(lOT (LESS-THAI (A (TOTAL-BmCT T?-78)) ZERO)) 
(EQUAL-TO (D (T0TAL-EIEE6T T -78)) ZERO) 

(9- (TOTAL-EIERCT Tf-78) <♦ (KXIETIC-EIER6T T?-80) (P0TEITIAL-EIER6T ??-79))) 
(q> (AIGDLAR-OISFLACEIIEIT ??-79) (AI6LE ??-80)) 

(FORCE-APPLiaTIOI (EESTORATITE-FORCE TV-79) (AI60LAR-?EL0CITY ??-80))) 

(DBFEITITT (SP&nG-S rT-76) 

(qVAlTZTT (AienLAR-OISPLACENER rT-7«)) 

(qvAirm (iestoratzte-force rT-7«)) 

(llPlOP- (RESTOIATITE-FuRCB rT-7«) (AieULAR-OISPLACEHBIT r?-7«)) 
(COIRBSPOIOBICB ((A (RESTOUTITE-FOECE rT-7«)) ZERO) 

{\. (AICDLAR-OZSPLACEIIEIT r?-7«)) ZERO)) 
(qVAlTITT (P0niTIAL-EIER6T rT-7«)) 
(lOT (LESS-THAI (A (P0TEITZAL-EIEE6T r?-7e)) ZERO)) 
(q- (P0TERIAL-BIBR6T rT-7«) 

(• (AI6ULAR-0ZSPLACEIIEIT rT-7«) (AICQLAR-OISPLACENEIT ??-7e)))) 

(ASSUME (SPmG-NASS-STSTEII-22 SK-SM-STS-23 STRHGl BALL9)) 
(ASSUME (SPRHG-S STRIIGl)) 

(ASSUME (PO ICE-AP PLICATIOI (RESTORATZTE-FORCE STRIIGl) (AIGULAR-?ELOCZTT BALL9))) 
(ASSUME (qUAITXTT (RESTORATZTE-FORCE STRIIGl))) 



Figure 9.7: Final PHIIElS hypothesis explaining the behavior of the torsion oscillator. 



is a token representing the spring-mass system taken as a whole. This compound object 
token is replaced by sk-sm-S7S-23, which represents the newly defined string-ball system: 

(spring-aass-i7it«i-22 ik-iB-i7i-23 itringl ball9) 

The proposed model of the rotating ball scenario, shown in Figure 9.7, is now usable. 
When the model is applied to the ball-string pair, it produces an envisionment containing 
an eight-state cyde, as shown in Figuro 9.8. When PHIIEAS examines the envisionment, 
it fin a perfect match between the observed and pr«Hlicted behavior. Thus, the model is 
adequate and the explanation process is completed. 
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Figure 9.8: Complete envisionment produced by the hypothesized torsional oscillator 
model* 
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Discussion 



The behavioral descriptions used in this example are larger and significantly more ambigu- 
ous than any other SK£ has been applied to (55 expressions each, producing 1,972 gmaps). 
It took approximately 45 minutes for SME to compare the spring and torsion behavioral 
descriptions. Although this time is large, early attempts at the example were competely 
unsuccessful. After several hours, SME would exhaust the memory capacity of the machine.^ 
Two modifications have been crucial to the example's current success. First, the r *sted 
representation of time described in Chapter 3 provides significant constraint on the map- 
ping process. Second, relaxing structural consistency to allow relational groups reduces 
fragmentation, further constraining the process. 



0.2.1.1 LC Circuit 

The LC circuit is one of the most difficult scenarios to which PHIHEAS has been applied. It 
is initially given a descrip. 'on of the circuit's sinusoidal behavior in term^ of the derivatives 
and amounts of charge and current. In addition, it is told that L and C are an inductor 
and a capacitor, respectively, although PHIHEAS has no knowledge about inductors or 
capacitors. F naliy, PHIIEAS is given the hint that the capacitor possesses an electrical 
restorative force. Without the hint, two answers are equally plausible rather thf i one (i.e., 
L might possess the electrical restorative force). The hint merely serves to simplify the 
discussion. 

PHIIEAS sue ^esffuUy maps position to charge, vslocity to currsnt^ and the spring 
and mass to C and L respectively. When the charge and current model is applied to the LC 
circuit, an envisionment containing a consistent eight-state cyde is produced. 

This example required a small degree of tailoring before it would work as well as the 
other oscillator analogies. The reasons for this provide insight to a hard problem. The 
original attempt at modeling the spring-mass system described the spring's length in terms 
of its rest length and the block's current position. When mapped to the LC circuit scenario, 
the quantities Imgth and roat-l#ngth were applied to the capacitor, with no apparant 
contradiction. This misses the point of the analogy entirely. It appears that a general 
solution to the problem requires in*depth study of the role of length and rest-length in 
the spring-mass system. For example, the s} ing's rest length should probably correspond 
to the capacitor's point of zero voltage. The problem was resolved for the above example 
by reformulating the spring-mass model, length and rest-length were removed, with the 



^All examples in this thesis are taken from PHIIEAS nwiing on a Symbolics 3640 with 12 Mb Ram and 
100 Mb disk swap tpace. 
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(Ssb&tBAe« ««lstiom-8t«ff ) 

(CoataiMl-liilftU s«l«tioal) 
(CAataiMr-ef solatioal eQataia«rl) 
(8abstaae«-«f solatina solntioa-staff) 
(Solatioa s«latioal) 
(Coataia«d-IiqaiA solatiea3) 
(Coataia«y-«f 8elati*a2 eoataia«r2) 
(8ab8taae«-«f Mlatioa3 solatioa-staff ) 
(Solati«a Mlati^aa) 
(Caa-eoataia «oataia«rl solatloa-staff ) 
(Caa-eoataia o«ataia«ra solatioa-staff ) 
(T«aekiaf eoataia«rl eoMoa-vall) 
(Toa«kiaf eoataia«r2 coBoa-vall) 
(Toaekiag eoataia«r:. eoHBoa-floor) 
(T«a«hiaf eoataia«r.t eoMoa-f .loor) 
(?«laM*a«lld oMBoa-flloor) 
(ar«at«r-tkaa O («aoaat-«f solatioal)) i«ro) 
(ar«at«r-tkaa (A (imat-of solatioaa)) i«ro) 

(H««ts (Sitaatioa osaosia-sitl 

(S«t (I»«€r«aaiag (iMaat-of solatioal)) 

(Xacraaaiaf (Coaeaatratioa ■olatioaD) 
(Xacr«aa'^ (Aa«vat-af solatioaa)) 
(D«€r«aaiaf (C«ac«atrati«a ■•latioa3)) 
(ar«at«r-thaa (A (Aamt-of ■olatioaD) 
(A (Aaoaat-of solatioaa))) 
(ar«at«r-tlMM (A (Coae«atratioa solatioaa)) 

(A (Caae«atrati«a solatioai))))) 
(Sitaatioa •Mosia-sita 

(Sat (CMstaat (Aaoaat-«f solatioal)) 

(Cmtaat (Coac«atrati«a solatiaal)) 
(Caastaat (Aaaaat-af solatioaa)) 
(Coaataat (Ceae«atrati«a solatioaa)) 
(Iqul-ta (A (Coacaatratiaa solatioa2)) 

(A (Coaeaatratita salatioal)))))) 

Figure 9.9: Two containers sharing a common wall of unknown substance. Each container 
holds a solution. 



solutioni solution2 




spring force determined by the position of the block. This was suflSident to produce the 
results described above. 



9.3 Osmosis 

Figure 9.9 depicts two containers sharing a common wall composed of an onknown sub- 
stance. Each container holds different amounts of a solution. We observe that the amount 
of solution is decrea^in^; in the left container (container 1) and increasing in the right 
container (containers). At the same time, the concentration of solutioni is increasing 
while the concentration of solutions is decreasing. h happening? 

PHIHEAS begins by exploring memory for potential analogues. It finds four posdHlities/ 
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but at the same time encounters significant ambiguity, as evidenced by the number of 
interpretations for each analogue: 

Analogue # interpretations similarity score 

2-container-Iiquid-fiow (2) 17.7 

leaking-container (1) 16.8 

dissolving (2) 14.8 

boiling (6) 16.4 

The multiple interpretations arise for two reasons. First, there is ambiguity over whether 
to map amount-of to the solution's amount -of quantity or the solution's concentration 
quantity. Second, the observation describes the change of two contained liquids. Thus, 
in the dissolving analogue, one interpretation views the salt mapping to solutionl while 
the other views the salt mapping to solution2. A similar situation occurs for the boil- 
ing analogue. However, the reason is of significance. The one-to-one criterion prevents 
Contaiaed-liqold from mapping to both Contained-liquid and Containtd-gas. Addi- 
tionally, both gas and liquid are in the same container for the boiling scenario. This again 
leads to ambiguity in mapping to containarl or contain«rr2. 

PHUEAS examines each of these possible analogues and finds that none of them com- 
pletely accounts for the observed behavior. The greatest coverage comes from the two- 
container-liquid-flow analogue. Its entire hypothesis consists of: 

(Fluid-Path cooBon-wall) 

The envisicnment for this hypothesis was shown in Figure 6.5. It consistently describes 
the loss of solutionl and the rise of 8olutioD2. However, it incorrectly predicts that the 
concentration of each will remain constant. 

Because the theory revision module is still incomplete, and all of the hypotheses foil to 
adequately explain the observation, PHIIEAS concludes with a task queue filled with pending 
revision tasks. For this example, I had PHIIEAS return the best, inadequate explanation 
available. When the LEF is applied, the two container liquid flow hypothesis is selected, 
due to its resting on a single assumption about the nature of the wall separating the two 
containers. It is important to note that PHIIEAS' work was made needlessly complex due 
to the experimental settings of the task priorities. The primary point of this "osmosis" 
example is to ihow how PHIIEAS uses global similaiity to focus on the most plausible 
explanations first. Because of the observation's strong similarity to liquid flow, PHIHEAS 
developed the liquid flow hypothesis first and only afterwards did it entertain the other 
possibilities. 
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Figure 9.10: A beaker and a vial, each containing water, are connected by objects. What 
is causing the water in the beaker to decrease while the water in the vial is increasing? 



9.4 Ordinary Liquid Flow 

One of the goals of this work is to show the feasibility of similarity as a single source for 
both analogical and more traditional, deductive or abductive explanations. Consider the 
scenario illustrated in Figure 9.10: 

• A beaker and a vial, each containing water, are connected by objects. What 
is causing the water in the beaker to decrease while the water in the lial is 
increasing? 

^HIIEAS begins by probing memory for the best set of candidate analogues. It finds 
foar initial possibilities: 

SimUarity Scare Analogue 

28.07 two container liquid flow 
15.77 l^aky container 
14.24 diisolTing 
14.14 boiling 

Ex am iniD g each of these possibilities, PHIIEAS finds that not only is the two container 
liquid flow scenario most similar to the current situation, it is the only candidate that pro- 
duces a consistent set of predictions. Thus, PHIIEAS concludes with the single assumption: 

(Fluid-Path objects) 

which is sufficient to completely explain the observed behavior (see Figure 9.11). Under 
this assumption, the rituation is viewed as hk normal instance of liquid flow, wit^t objeci3 
serving as the fluid path. 
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Hypoth«t6t for theory OBJECT3-FL0W-THE0Rf-5 d«riv«d from 2-COITAIIER-LF! 

(ASSmE (FLDflD-PATH 0BJECT3)) 
Analysis of 0BJECT3-LF according to thoory 0BJECT3-FLOtf-TH£0RT-5 

In bthftYioral sogmant 1 

"[PRESSURS-II VIAL6) is Increasing 
(.\N0UIT-OF CS-VATER-VIALl) is Increasing 
(VEESSURE-n BEAKER6) is Dacraasing 
(ANOUIT-OF CS-VATER-BBAKERl) is Dacraasing 

(A (PRESSURE-II BEAXER6)) is Graatar Than (A (PRESSURE-II VIAL6)) 

Dua to the following procasaas baing active: 

LiqniD-FLOV(VATER BEAXER6 ""S-VATER-BEAKERl VIAL6 CS-VATER-VIALl 0BJECT3) 

In behavioral segment 2 

(PRESSURS-n TIAL6) is Constant 
(ANOUIT-OF CS-VATER-TIALl) is Constant 
(PRESSURS-n BBAXER6) ia Constant 
(ANOUIT-OF CS-VATER-BEAKERl) is Constant 

(A (PEESSURE-n BEAKER6)) ia Equal To (A (PRESSURE-II VIAL6)) 
There are no processes active. 

Figure 9.11: To explain why the amount of water in th^ beaker is decreasing and the amount 
of water in the vial is increasing, PHIHEAS requires only a single assumption: objects is a 
fluid path. 



9.5 Summary of examples 

PHINEAS has been tested on over a dozen examples representing variations on a set nine 
basic explanation tasks. These are summarized below. 

1. Disappearing akohoL This example has been discussed in detail, with variations 
described in Section 8.4.1. Four analogues have been considered: bcnling, a leaky 
container, dissolving, and two container liquid flow. 

2. Caloric heat flow. The transfer of heat from a warm object to an object of lower 
temperafure may be explained via analogy to liquid flow. PHIHEAS conjectured the 
existence of a new substance in the process. 

3. OscUtation. Simple harmonic motion is a common occurrence. Three oscillatory 
phenomena have been explained by analogy to a spring-mass system. 
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(a) Torsion oscillator 

(b) Cantilever oscillator 

(c) LC circuit 

4. Osmosis. The flow of a solution's solvent through a semi-permeable membrane is 
very similar to ordinary liquid flow. PHINEAS has demonstrated that global similarity 
is useful for focusing the search for plausible explanations. Because the analogy to 
liquid flow requires revision to the standard model, PHINEAS was unable to successfully 
derive an adequate explanation. 

5. Floating beach ball The stability of a beach ball suspended in an upward-flowing 
cdumn of air is counter-intuitive to some. This example demonstrates the power of 
simulation analysis sanctioned by verification-based analogy. PHINEAS was able i'^ 
reject a proposed "pushing** model based on the model's inconsistency with obser- 
vation. At the same time, the "pullirg" model drawn from Bernoulli's principle was 
successfully proposed as an adequate explanation. 

This example was selected because it deuly demonstrates a central intuition of 
VBAL. Its actual implementation was neoissarily ad-hoc. The scenario possesses 
a great deal of geometry and there appears to be a strong visual factor. When I 
posed this scenario to various people, those giving the incorrect response would t3rp- 
ically describe the air as "holding the ball in" in reference to iti; being enveloped by 
the flowing air. The models considered by PHINEAS are one-dimensional (in the hor- 
izontal direction) and state that the force is a function of the amount of air on each 
side. In turn, the amount of Jr is a function of position, with zero corresponding io 
the center of the air column. 

6. Diffusion. Osmosis and diffusion are roughly equivalent processes. WTien PHINEAS is 
supplied with full knowledge of osmosis for liquids, it is able to successfully explain 
diffusion for gases. Again, global similaritj is a c^z^xrce of focus. Osmosis is more 
similar to diffusion than liquid flow. 

7. Ordinary li^id flow. PHINEAS' starting body of knowledge and experiences contains 
different examples of the basic concept "liquid flow". As in all explanations it pro- 
duces, PHINEAS explains new liquid flow observations by way of their strong similarity 
to the standard liquid flow scenario prototype. This is in contrast to most explana- 
tion and qualitative reasoning systems, which examine a theory's preconditions for 
applicability. Two different liquid flow scenarios have been tested: 
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(a) Objects path. This was described in Section 9.4. 

(b) S container liquid flow. This example demonstrates an important difference be- 
tween PHIHEAS' analogy architecture and more common instantiation methods. 
Three containers connected in series (canl to can2 by pip6l2 and can2 to can3 
by pip«23) produce two simultaneous instantiations of the liquid flow process. 
When PHIHEAS is given a description of ^3 container liquid flow", it finds two 
different analogies with the potential analogue, ^2 container liquid flow". One 
analogy is with the can2 to canl flow, the other is with the can2 to can3 flow. 
These are processed as two, independent candidate explanations. Additionally, 
they are bo«h adequate, since QPE fortuitously applies the proposed liquid flow 
model (intended for the can2 to canl pair, for example) to both container pairs. 
Thus, PHIHEAS concludes with two adequate, functionally equivalent explana- 
tions. It does not possess the knowledge that a single phenomenon was simply 
occuring twice. This ^generalization to problem (e.g., Shavlik, 1987) is an 
important case which research in analogy has failed to address. 

8. Floating. The lift a helium-filltd balloon possesses is analogous to an object floating 
in water. The net effect of this analogy is that PHIHEAS generalizes Archimedes 
principle from liquids to fluids. However, since PHIHEAS does not possess a final 
storage mechanism, it ends up with two versions, one for gases and one for liquids. 

9. Hot cup. Kedar-Cabelli (1988) describes an example in which the task is to explain 
why a ceramic cup may be classified as an instance of the concept ^hot cup". An 
analogy to a styrofoam cup, a known type of hot cup, is used to assist the proof pro- 
cess. The styrofoam cup explanation is represented as a single horn clause justifying 
hot*cup, with intermediate justifications compiled away through explanation-based 
generalization. In the version ^ven to PHIHEAS, the ceramic cup was r laced by 
a metal cup to demonstrate the importance of contextual factors in determining se- 
mantic similarity. A mapping technique which used an ISA hierarchy would map 
styrofoam to m«tal, given any standard hierarchy organization. This is the wrong 
mapping. In the context of the styrofoam explanation, styrofoam is satisfying the 
role of insulator. Its analogue for the metal cup is has'-handle. By inspecting the 
styrofoan^ cup explanation's cache, PHIHEAS is able to make the correct mapping. 
Recall that minimal ascension is only used if no functional analogue can be found. 
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Chapter 10 
Discussion 



There were two primary motivating goals for this thesis. First and foremost, develop 
a general, working model of analogical processing that is powerful enough to perform a 
complex reasoning task. This I feel has been accomplished. While PHINEAS is certainly not 
task-independent, I believe the model it embodies has broad utility. Second, apply that 
model to the task of providing qualitative explanations for observed physical phenomena, 
using representation and analysis techniques from the state-of-the-art in qualitative physics 
to demonstrate scale and generality. This too I fed has been accomplished to a large extent. 
PHIHEAS is able to offer analogically derived explanations for a variety of observations 
novel to its initial domain theories. It is able to envision the predictions of a developing 
explanation, from which the explanation's adequacy may be tested and revisions sanctioned. 

Embedding the model in a non-trivial performance task is an important methodolog- 
ical constraint for research in machine learning. Learning should provide a service, such 
as improving system performance or enabling solutions to problems otherwise unsolvable. 
Techniques developed in support of complex inferences have fewer arbitrary choices than 
techniques developed specifically for learning research. Additionally, placing learning re- 
search in the context of a serious performance task often leads to discovery of fundamental 
research problems that would have otherwise gone unnoticed This was certainly true 
in the development of PHINEAS. To my dismay, I was constan;ly surprised by problems 
that, for whatever reason, had not been described in the literature. These include the 
N-M binding problem, the structure rearrangement problem, and the many ways in which 
matching predicates based on an ISA hierarchy fails. Conv ely, the learning research 
may help to motivate resi^arch in the application task domain. For example, my moti- 
vation for the work on large-scale, multiple-perspective qualitative modeling described in 
(Falkenhainer & Forbus, 1988) arose from frustration in attempting to model analogical 
access for commonsense physical domains. 
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Despite the progress made, far more remains undone. This chapter begins 1^ revievring 
the results of this work. It then compares the work to related research fzom different aspects 
of AI. Finally, a number of important open research problems are discussed. 



10.1 Summary of Results 

In summary, the primary contributions of this thesis are: 

• It presents verification-bMed analogical learning^ a view of analogy as an iterative 
process of hypothesis formation, testing, and revision. This results from the verifica- 
tion requirement^ which demands that an analogy system analyze the conjectures it 
makes and attempt to repair them when insufficient. 

• It presents contextxud structure-mapping^ an adaptation of Centner's structure- mapping 
theory which stresses the need to use context and knowledge about the representa- 

V tions being manipulated as an aid in the mapping process. This has three primary 
components: 

1. Pairwise mappabtlUy of predicates is context sensitive. An implemented defini- 
tion of functionally analogous was presented which solves a specialization of this 
more general problem. Specifically, two expressions are functionally analogous if 
they provide the same inferential support in the context of the structures being 
mapped. 

2. The structure rearrangement problem for purely structural models (e.g., SMT) 
was identified, and domain-specific mapping constraints were defined to prevent 
the I *oblem. Additionally, the structural consistency constraint of SMT was 
weakened to allow for the mapping of relational groups. 

3. The selection criterion for mapping is generalized to include contextual rele- 
vance in addition to systematidty. This overcomes the tendency of the pure 
systematidty criterion to focus on coherent but irrelevant interpretations. 

• It describes how contextual structure-mapping may be modeled as a set of match 
rtdes given to SHE, a general, domairi-independent computational model jf analogical 
mapping that has been tested on many examples from numerous domains and in 
several research projects. 

• The transfer operation is formalized in the context of explanation. Transfer is con- 
cerned with importing candidate inferences proposed by mapping into the target 
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domain and making them operational. This centers around ensuring expression coi- 
sistency and resolving skolem objects. 

• Transfer and mapping have been paired to form a map and analyze cycle. Mapping 
provides focus by globally identifying analogical conmionalities between base and 
target descriptions. This enables transfer to focus analysis on the candidate infer- 
ences, which represent either base information inferable for the target or points of 
discrepancy in the analogical comparison. 

• It describes an approach to the problem of analogical access across different domains, 
which requires retrieval over non-identical features. Shared behavioral abstractions 
are used to focus attention on a potentially relevant subset of memory. Each experi- 
ence in this subset is then compared at a detailed level to the current target situation 
to determine if they share a common relational structure, despite their surface-level 
differences. 

• It introduces a performance-based measiue for evaluating the adequacy of an analog- 
ically proposed model. In PEIIEAS, this centered around the use of qualitative sim- 
ulation as a form of gedanken experiment to test the predictions of a model rigainst 
observation. 

• It describes an approach to theory revision based on the view that generation of revi- 
sion hypotheses should not be fundamentally different from generation of the original 
explanatory hypotheses. It proposes the use of three sources of knowledge. First 
principles analysis provides completeness in suggesting possible revisions. Precedent- 
guided revision uses knowledge of analogous phenomena to provide a focused, ordered 
set of hypotheses. Finally, difference-based reasoning examines prior successes to see 
what is different about the current situation that may provide empirical explanations 
for hypothesis plausibility. 

• It describes PHINEAS, a fully implemented system based on the proposed analogical 
explanation architecture. PHIVEAS posits qualitative explanations for time-varying 
descriptions of physical behaviors. 

Based on these ideas and my experience with PHIVEAS, I have also proposed that 
interpretation-construction tasks may be viewed as the search for maximal, explanatory 
similarity between the situation being explained and some explainable sceaario. This led 
to the conjecture that analogy suffices as the central process model for explanation tasks, 
with distinctions between deductive, abductive, and analogical explanations arising out 
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of the evaluation process. While PHIHEAS demonstrates the feasibility of this view, much 
further research will be required to prove this conjecture. 

10*2 . Related Research 

This thesis covers a range of AI concerns, including analogy, scientific theory formation, 
and qualitative reasoning. This section identifies important similarities aud differences with 
related research from these various disciplines. 

10.2.1 Analogy 

Analogy has many facets, making ?t difficult to thoroughly review all of the work from 
philosophy, psychology, and AI which might overlap some portion of this research. In the 
following subsections, a select subset of work in computational analogy most lelevant to 
this thebis is discussed. One should also refer to (Falkenhainer et al., 1987) for a thorough 
survey of work comparable to SHE and analogical mapping. For example, Winston's (1980, 
1982) highly influential work and its relation to SHE is discussed. See also (Hall, to appear) 
and (Kedax-Cabelli, 1985a) for general surveys of analogy. 

10.2.1.1 Burstein^s CARL 

Burstein (1983, 1985) presents a detailed investigation of how a student may ue taught 
about assignment statements in BASIC through a series of analog;ies across multiple- 
domains. CARL interacts with a tutor, who identifies important similarities between vari- 
ables and boxes, algebraic equality, and human memory. One of CARL's important features 
is its ability to use multiple analogies in constructing a model of BASIC assignment state- 
ments. An analogy with boxes is used to provide a sense of ^placing a number inside a 
variable''. Algebraic equality is used to repair the conception thai "X = places "Y" 
inside X, rather than the value for Y. Finally, a human metaphor is used to express the 
feel of a computer as a communicative agent. 

The minimal ascension mapping rule (Chapters 3 and 5) is adapted from Burstein's 
method for creating a new predicate when a base relation is inconsistent for the target 
domain. The minimal ascension technique adds two functions. First, it may be used 
during mapping to allow an explicit match between non-identical predicates. This match is 
inversely proportional to the distance of the predicates paired and enables direct comparison 
of alternate pairings. Second, it is used during transfer to seek existing predicates, not only 
to create new predicates. 



185 



Some of Burstem's work may be viewed as important next steps in the development 
of PHIIEAS. SpediicaUy, Burstein has focused on multiple analogies, in which a working 
hypothesis about a situation arises through integration of alternate sources of information. 
Consider the disappearing alcohol example. Prom a dissolving analogue, PHIHEAS was able 
to hypothesize that the rate of disappearance is proportional to the alcohol's surface area 
in contact with the atmosphere. From a boiling analogue, PHIHEAS was able to tap its 
knowledge about the vaporization process. However, the program was not able to merge 
these hypotheses about uiiferent aspects of the situation to form a more complete model 
of the situation. Flexible integration of multiple sources of information has always been 
part of the desiderata for PHIHEAS, but required the development of the current PHIHEAS 
system as a prerequisite. 

10 2.1.2 Derivational Analogy 

Analogical problem solving seeks to solve a current problem situation by recourse to a pre- 
vious problem solving episode. Two distinct approaches have been presented by Carbonell; 
transfarmaiiana! analogy (1983b) and derivational analogy (1983a). Each recognize that 
applying old solutions to new problems may involve modification to the original solution. 
Transformfttional analogy applies transformation operators to a base solution until it solves 
the current problem. There are nimiber of problems with this approach, as Carbonell has 
noted, and derivational analogy was proposed in response. Specifically, mapping only fi- 
nal plans or operator sequences hides important planning information used to derive the 
sequence. For example, translating a quicksort routine from PASCAL to LISP by direct 
translation would be far more difficult than reusing the abstract design process used in 
developing the original PASCAL version (Carbonell, 1983a). 

Derivational analogy replays the derivational history of a previously solved problem. 
The derivation mqr contain a record of all important decisions made in the course of solving 
the base problem, including the alternate plans and subgoals considered, justifications for 
successfol steps, reasons behind failed steps, and any other constraints influencing the 
development of the final solution. The target problem is solved by examining each step 
in the derivational history. If the justifications for that step hold in the current situation, 
the step is applied. If the justifications are missing, then alternate support for the step is 
sought, one of the alternate choices considered in the derivation is applied, or some form 
of subgoaling is invoked. 

An important contribution of Carbonell's proposal is the observation that the map- 
ping process as commonly described in matching models of analogy (e.g., Winston, 1980; 
Centner, 1983) is overly simplistic for complex analogical problem solving. That is, map- 
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ping may often jnvolye manipulation of the original base description in response tc the 
current target environment. Replay is one approach that enables a base situation to be 
adapted to the current target instance. In Chapter 5, I proposed that this is an implicit 
specialization of a more general problem in analogy - the general need for knowledge about 
the structures being manipulated in the mapping process. The definition of functionally 
analogous is an attempt to address this problem, but is still a specialization of a much 
larger problem. 

As a proposal, derivational analogy leaves many questions unanswered. First, what 
information should be stored in the derivational history? Clearly, rememb<iring everything 
about every problem solving experience is prohibitive. If the derivation is too detailed, 
analogical reasoning could be more time consuming than solving the problem without 
analogy. Second, the model lacks an explicit account of similarity, particularly important for 
mapping solutions across domains. As H-fcribed, derivational analogy is most appropriate 
when a nearly equivalent problem has been solved in the past. Finally, derivational analogy 
is based purely on the problem solving paradigm and does not attmpt to represent a model 
for analogy in general. For instance, it has little to say about analogical learning. 

10.2.1.3 Kedi^--Cabelli'8 PER 

Kedai'-Cabelli (1988) describes a method for purposive explanation replay ^ PER. The prob- 
lem PER addresses is, given an existing plan schema and an old individual that filled one 
of the roles in the schema, find an individual in the current state that may be substituted 
for that role and still achieve the plan. Like derivational analogy, PER is a replay method 
for mapping, which attempts to adapt an old plan for a new situation by rejustifjring and 
replanning around portions of the old plan that are not supported by the current situation. 
PER is like the functionally analogous portion of PHIHEAS' transfer process in that both at- 
tempt to find alternate inferential support for unsupported portions of a proposed analogical 
inference. PHIIEAS is an extension of PER in several ways. First, PER's implementation, 
REPeat, guesses the functional role of an unsupported item by retrieving a horn clause in 
which the item appears as an antecedent, followed by reproving the consequent. Since a 
given item may support many different types of consequents, this is a rough heuristic. In 
PHIIEAS, the cache slot serves the same purpose, but explicitly states the actual reason 
why the unsupported item was needed in the base description. Second, since PER picks a 
candidate base analogue at random (each goal is associated with past planning instances 
that have solved it in the past), it may spend a great deal of time reproving an analogue 
that violates PER's near miss assumption. PHISEAS uses global similarity throughout its 
reasoning process, enabling it to focus first on the optimal candidate analogue (optimal in 
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the sense of maumal match). The map and analyze cycle further focuses the process by 
maintaining a global perspective of what has and has not matched between the two situa- 
tions. Locating one missing correspondent may fortuitously uncover other correspondents 
due to their mtcrrclatedness. Analyzing each missing correspondent one at a time may fail 
to detect such discoveries. Finally, PER is designed to generate d-sound inferences only. Of 
course, this is important for planning, in which the system should attempt to ensure that 
the proposed plan will succeed. However, it requires that all relevant knowledge be avail- 
able and is limited to non-learning situations. PHIIEAS' mechanisms for seeking assumable, 
alternate relations may be seen as an attempt to alleviate this problem. 

10.2.1.4 Greiner's NLAG 

Greiner's (1986, 1988) work, like this thesis, addresses the problem of analogy in the context 
of i-sound inferences. Given an analogical hint "A is like B" (A^^^B), a target problem FT, 
and a theory Thj his NLAG program uses the abstracHon-based useftd analogical inference 
operator |^ to propose a solution for FT in the target domain A. |~ is defined as: 

Th,k^3 hPT^(A) 

where Unknown: Th ]^ (p{k) 
Consistent: Th )^ -i^(A) 
Common; Th f= ^(B) 
Useful: Th U {(p{k)} h PT 

NLAG begins with a problem statement (e.g., "find the flowrate through the stem of 
water pipes forming a Y-junction"), an impoverished theory unable to solve the problem, 
and an analogical hint (e.g., ^owrate is like current"). NLAG then searches through its 
known set of abstractions which refer to current and are able to solve a base rendition of 
the target problem. The analogy process is primarily concerned with reinstautiating the 
base abstraction for the target problem, such that it is both consistent and useful. 

From a strictly practical level, NLAG is one of the few domain-independent analogical 
learning systems demonstrated on a series of non- trivial tasks. From a more theoretical 
level, Greiner's work provides important first-steps in formalizing analogy. He states two 
important constraints on the analogy process. First, it establishes consistency as a central 
constraint. Second, it proposes utility as an additional constraint on analogical inference. 
This is a predecessor to the performance measure of inference adequacy described in Chap- 
ter 6. 

However, Greiner's work places excessive constraints on the analogy process. First, the 
consistent requirement forces analogical inference to be monotonic. Second, the theory 
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views analogy as. a form of instantiation over extensionally defined schemas or abstractions. 
Thus, much of the answer is given before the process starts. Finally, abstractions must apply 
without change. This does not take into account the potential adaptation required when 
xooving from one situation or domain to another. 

10.2.1.5 Holyoak and Thagard's ACME 

Since the first publication of SHE (Falkenhainer et al., 1986), Holyoak and Thagard (1988a) 
have developed ACME, a program which places much of the SHE design into a connectionist 
relaxation framework. Specifically, rather than the three merge steps used in SHE, ACME 
feeds its local match hjrpotheses into a lateral-iniiibition. spreading activation network 
(figuratively equal to Figure 3.4). This eliminates the potentially expensive merge steps 
by lepladng them with a connectionist relaxation technique that is able to produce a 
**best^ interpretation without explicitly generating all itterprctatious. By using a relaxation 
network, their constraints on mapping are not absolute, but rather provide "pressures" 
that guide the emergence of a global mapping. This is very elegant. On the other hand, 
they dte SME's development of absolute ^temate interpretations as a limitation that ACME's 
pressures are able to avoid. This conclusion needs further consideration. Unlike SHE, ACME's 
pressures are not governed by global context to ensure that purely local considerations do 
not produce global incoLerency. For example, this feature" would cause ACME to generate a 
single interpretation in mapping one necker cube to another, since many-to-many mappings 
are allowed if there is equal evidence supporting each interpretation. 

10*2.2 Explanation and Theory Formation 

In addition to analogy, this work is relevant to research in explanation apd scientific dis- 
covery. The following subsections examine this relationship. 

10.2.2.1 Interpretation and qualitative reasoning 

PHIHEAS is an interpretation system, ascribing plausible causal explanations to an observed 
behavior. Prior modeb of causal analysis reason from a fixed set of axioms to provide an 
explanation of a physical system's behavior. They assume knowledge of all relevant physical 
processes and the complete structural description required to apply these processes to the 
current situation. 
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One sucli method is ATMI (Forbus, 1986a; DcCoste, 1989).* The ATMI method ap- 
proaches th,- interpretation problem in a purely forward-going manner: determine every- 
thing that could possibly happen given the current physical configuration and then see if 
the observation corresponds to one of those predictions. This has two problems. First, 
predicting everything that could possibly happen for a specified physical setting is pro- 
hibitive for large problems. Incremental envisioning^ in which predictions ate generated 
iacrementally in response to observation, appears to be a potential solution for this prob- 
lem (DeCoste, 1989). Second, if the physical configuration is not fully specified, the set of 
predictions with which to compare will be empty unless assumptions can be made about 
the scenario. For a forward-going mechanism, this requires a priori specification of what 
may be assumed, which seems untenable if generality is desired. Specifically, the ATMI 
approach has no mechanism for dictating model selection once assumptions are allowed. 
Placing PHINEAS on top of an ATMI approach has the appealing characteristic of providing 
the focus common to backward-going abductive systems, while retaining ATMI's concern 
for the problems of numeric data analysis and real time processing. PHIHEAS is able to focus 
model selection, identify needed assumptions, and conjecture novel models if the domain 
theory is incomplete. 

10.2.2.2 Scientific discovery 

When viewed as research in machine discovery, this work represents a significant devi- 
ation from traditional research (Langley, 1981; Langley et al., 1987; Rdkenhainer, 1985; 
Falkenhainer & Michalski, 1986; Rose & Langley, 1986; Zytkow & Simon, 1986), which has 
focused on the formation of empirical laws characterizing a set of observations. These 
equations, whether numeric or symbolic, characterize the behavior of a situation rather 
than explain it. For example, BACOH (Langley, 1981) and ABACUS (Falkenhainei, 1985) 
could "discover" the ideal gas law (PV=nRT) but could not explain it. Likewise, STAHL 
(Zytkow & Simon, 1986; Rose & Langley, 1986) could not explain how coal is composed of 
"matter of fire" and ash. One of the drawbacks with these systems is their complete lack 
of knowledge about the domain, the data variables being manipulated, and prior reasoning 
experiences. They fail to capture the use of models in scientific investigation, which serve 
to drive prediction and suggest relevant questions, leading to further investigation. 

While ABACUS was an investigation of the knowledge- weak end of the spectrum, PHINEAS 
is an investigation of the opposite, knowledge-intensive end. In being theory-driven, it is 

*Heie, I do not distingnish between Foibns' ATMI piogiam and Decoste's DATMI piogtam, which each 
share the featuies lelevant to this dljcnssion. 
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more in the cpirit of recent work by Rajamoney (1988a, 1988b), He examines Ihe problem 
J of revising existing theories in light of new, anomalous observations, using explanation- 
based and directed-ezperimentation techniques to prune the space of revision hypothe- 
ses. These methods are powerful for identifying consistent revisions, but prune candidates 
rather than reduce the search space. The analogical approach to theory revision pro- 
pos<^ in Chapter 7 and the combined use of analogy and experimentation presented in 
(Falkenhainer & Rajamoney, 1988) are intended to alleviate this problem. 

10.2.2.S Thagard and Holyoak's PI 

The type of theory formation through analogy described in this thesis is an instance of what 
Thagard (1987) ctJls analogical abduction. While his PI system (Thagard k Holyoak, 1985; 
Thagard, 1987) addresses the same type of task, our methods are very different. First, PI 
reasons using rule and schema represf :itations that are substantially less sophisticated than 
those used for PHIHEAS. For example, a rule that waves ^reflect" is used to propose that 
sound might "reflect" because it is a wave (i.e., Wav#(x) ^ R#n«ct(x)). Second, the 
need for potential refinement is not recognized. Furthermore, these schema-based models 
are not runnable and thus cannot generate predictions at the level needed for empirical 
confirmation. 

10.2.2.4 The Yale SWALE project 

SVALE(Kass, 1986; Leake k Owens, 1986; Kass et al., 1986) is a system under development 
at Yale which uses stored explanation patterns to construct explanations of novel events, 
such as the death of a famous racs horse. It is able to adapt prior explanations to novel 
situations by tweaking the schema to fit the specifics of a new case. In this manner, it 
has many similarities to Kedar-Cabelli's PER model, since both attempt to deductively 
rederive portions of a prior explanation that do not apply in the new situation. SWALE's 
replay process is able to achieve a fair degree of sophistication by drawing from a library 
of tweaking strategies which suggest ways to repair inapplicable portions of a recalled 
explanation. SVALE does not attempt to address the complete correspondence problem, 
since object level mappings are unambiguous in the small stories analyzed. Additionally, 
SVALE is designed as more of a deductive rather than analogical system. For example, it 
does not address cross-domain analogy issues, such as semantic similarity and forming new 
relations or objects. 
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10.2.2.5 Langley and Jones 



Around the same time that PHINEAS was first published (Falkenhainer, 1986), Langley t 
Jones (1986) proposed an architecture sharing its "analogy through behavioral similarity" 
component. Their proposal was directed at capturing the process of scientific insight (e.g., 
Dreistadt, 1968), in which recognition of a potential analogue is a temporally-extended pro- 
cess consisting of a gradually increasing familiarity with the target concept culminating in 
sudden recognition. Modeling this aspect of analogy is important and their work represents 
the only AI research in this area that I k'low of. 

10.2.2.6 Shrager's IE 

Shrager's (1987) IE is a system designed to formulate a theory, without instruction, about 
a device to enable successful operation of the device. The system's primary theory develop- 
ment mechanism is view application, which incrementally combines abstract schemas drawn 
from a variety of domains to formulate and reformulate developing theories. It bears strong 
resemblance to analogy and shares many of the abstract theory development goals found in 
PHINEAS. They may be compared on a number of points. First, although Shro ';er indicates 
that view application maps from an abstraction onto an instance, PHINEAS makes no com- 
mitment as to the content of the base information and typically draws from prototypical 
or abstract scenarios. Second, application of a view may reformulate existing knowledge, 
whereas PHINEAS' mapping mechanism is strictly additive. On the other hand, VBAL 
recognizes that an analogy (oi applied view) may be "almost right" and need slight ad- 
justment, whereas IE makes repairs only via additional views (analogies). Third, PHINEAS 
embodies a richer notion of predicate similarity, providing more autonomous access, and a 
mapping and transfer phase better able to cope with differences in vocabulary. 

I believe that view application corresponds to analogy mapping from abstractions, with 
its coercion operation corresponding to PHINEAS 's transfer phase. However, since the op 
erations of PHINEAS are not a superset of (nor a subset of) view application, additional 
research is needed to combine important features of bv.cn, 

10.3 The Future 

This section describes some of the fundamental open research problems encountered during 
the development of PHINEAS. In addition, a few more realistic short-term extensions and 
applications of the current work are described. 
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10*3*1 Analogy 

Research on computational analogy is still in its infancy. Below are a number of open 
problems that seem particularly important for future progress in the field. 

10.3.1.1 The access prob' in 

The problem of ietrieving a plausibly useful analogue from memory still stands as the 
least-understood, most important unsolved problem in analogy. To date, most research 
has been able to avoid the problem, either through tutorial settings in which valuable hints 
are provided or by restricting work to specialized forms of within-domain analogy. The 
two-stage mechanism described in this thesis (i.e., use abstractions lo focus on candidate 
set, use structural comparison to prune and order this set) still side-steps important issues. 
How are these abstractions formed for the stored situations? How are they recognized in 
the target situation? How are they organized so that an excessive number of analogues are 
not retrieved? 

10.3.1.2 The consistency - coherency problem 

In Chapter 2 I claimed that two-valued consistency is overly restrictive as a basis for 
analogical processing. However, no real solut. u is offered. PHINEAS* task is to do almost 
whatever it takes to construct an explanation for an ohi ved situation. It has no "I have 
no idea** mechanism, unless no candidate analogue can be foimd to initiate the explanation 
process or no hypothesis can be formed that is consistent with the observation. This is 
part of the intended design. The best explanation available that is consistent with the 
observation will be offered, no matter what the "cost". Determining if that "cost" is too 
great for the explanation to be accepted is considered the province of f.xiai acceptance and 
storage, which are postprocesses external to PHINEAS. An evaluative measure which takes 
into account the cost of overthrowing prior beliefs for the benefits of a more coherer* belief 
state is needed. Additional factors such as plausibility and specificity in accounting for the 
phenomenon are required as well.^ 

10.3.1.3 The semantic similarity problem 

The definition of functionally analogous given in Chapter 2 solved a simplified version 
of what appears to be a fundamental problem in analogy research - that the similarity 
of two if -^ms depends on their respective roles in the representations being manipulated. 

^Steps in this direction include HYPO (Ashley k Rissland, 1987) and ECBt (Thagard, 1988). 
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Indeed, the entire meaning of these representations may be context-sensitive. Recently, 
work on situated meaning ( Agre k Chapman, 1987; Suchman, 1987) has begun to explore 
these ideas. 

10.3.1.4 The compiled knowledge problem 

As discussed in Chapter 2, AI systems tend to ^'se compiled knowledge, in which interme- 
diate reasoning steps are absent to protiote efficiency of use. This runs counter to the need 
in analogy to understand the reasons behind why the base domain description is organized 
the way it is, so that it may be adapted for use in a new target situation. The amount of 
knowledge required to be a profident analogizer is greater than the amount of knowledge 
required to simply perform useful inferences in the original base domain. In PHINEAS, this 
was addressed by adding a cache slot to theories, indicating past reasoning not explicit 
in the theory's description. Specifically, the cache slot links necessary prerequisites of a 
process' effects relations to the antecedents that satisfy those prerequisites. However, this 
is a small fix to a much laiger problem. Specifically, a fundamental component of analogy 
is knowing why the relations being considered for mapping are there. Knowing why prior 
decisions were made the way they were is important to being able to satisfy the intent of 
the decision without necessarily adhering to the same decision.^ Work is needed to specify 
what knowledge is needed, what knowledge should be cached, how that knowledge is to be 
retrieved, and how that knowledge is to be used in the analogy process. 

10.3. l.R The one-to-one constraint 

In the specification of contextual structure-mapping, I adopted Centner's one-to-one map- 
ping constraint. It is clearly important, in that its complete removal results in nonsense 
mappings.* However, there are cases where relaxing the one-to-one constraint would be 
very useful. For example, consider an example adapted from (Kedar-Cabelli, 1988), in 
which a conical styrofoam cup (scup) is compared to a metal cup (mcup) with a handle, 
both for the purpose of containing a hot liquid. In the case of the metal cup, a single 
structural characteristic (i.e., Has-Handle) satisfies two functional roles (i.e., Insulated 
and Grasping^Area): 

*A similar point has been made by Carbonell (1983a) in the context of planning by analogy. 

*An experiment was tried using SMS with no one-to^ne constraints installed. At all times, this results in 
a single interpretation which is effectively the union of all possible interpretations SNE would have generated 
were one-to-one being enforced. 
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Function! 

InsuUted: 

Grasping-Area: 



StvnfoQm cttg 

(Styrofoui scup) 
(Conical scup) 




(Has-Handle mcup) 
(Ha8*Handl« mcup) 



Thus, a many-to-one mapping is necessary to process this rather obvious analogy. This 
is a prevalent phenomenon in mechanical design, in which a common goal is to minimize 
the number of parts by maximizing the multiple functions each part provides. Research is 
needed to identify when the important constraint provided by the one-to-one requirement 
should be violated. 

10.3.2 Abduction and default reasoning 

What is assumable? Work on abduction tends to side-step this issue in several ways. First, 
probabilistic models (e.g., Szolovits, 1982; Buchanan & ShortlifFe, 1984) have the luxury of 
a priori probabilities assigned to antecedent information, allowing the use of probabilistic 
decision analysis. When addressing open-ended, conunon-sense problems about the world, 
having such probabilities seems unrealistic. Second, most work in abduction has been ap- 
plied to diagnostic problems in which the set of possible diseases or malfunctions is explicitly 
enumerated (e.g., Buchanan k Shortliffe, 1984; Josephson et al., 1987). An enumerated set 
of all assumables for the domain of general commonsense physics could conceivably consist 
of the entire body of k^-owledge about the domain. What to assume seems dependent on 
the particular situation under consideration. Finally, work in interpretation and story un- 
derstanding (e.g., Chamiak, 1988; Mooney, 1987) tends to use schema-based models and 
identify the assumables as the unknown elements of a relevant schema. The approach seems 
to work in practice, but the reasons for this have yet to be studied. Most likely, schema 
representations implicitly package important interrelations between the constituent facts, 
producing intuitively appealing performance. 

In PHINEAS, Chamiak's (1988) approach of assume if consistent was adopted. While 
all nonsense amumptbos (according to my own personal evaluation) PHINEAS ever made 
were attributable to gaps in the domain theory, Pm still hesitant to consider it an actual 
solution. Some form of likelihood estimates seem necessary, though actual probabilities 
provide their own set of problems. 

For example, two alternate answers were possible in the osmosis example described 
in the previous chapter. These alternative^ center around the belief that the membrane 
separating the two containers is a Volume-Solid (i.e., solid all the way through, non-porous, 
no holes, etc.) in which case the membrane could not be a fluid path. 
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Why dozs my sliding garage door oscillate up and down rohen I open it quickly? 



Figure 10.1: Sliding garage door in open, equilibrium position. 



(Volum«-Solid ?obJ) (not (Fluid-Path ?obJ)) 

In the previous chapter, PHINEAS was shown to propose that the membrane was a Fluid-path 
by defeating its default assumption 

(Solid ?obJ) ^ A88UB«(VoluB«-Solid ?obj) 

However, if the membrane was firmly believed to be a Volume-Solid, then PHINEAS would 
have proposed the existence of an unknown fluid path, (FLUID-PATH SK-PIPE-3), rather 
than identify the membrane as the suspected path. Whether or not a third alternative, 
that the membrane is a new kind of Fluid-Path, should be one of PHINEAS' options is 
uncertain. It might lead to a combinatoric explosion in assumptions. 

10.3.3 Integrating and refining multiple models 

Figure 10.1 shows a sliding garage door in its open, equilibrium position. Told that the 
door is on a track, the task is to explain its operation and why it oscillates when opened 
(importantly, the observer does not know about the spring shown in the figure). Explaining 
the entirety of its behavior was one of the intended, but unrealized goals for PHINEAS. It 
is a highly desirable example for an analogical explanation system like PHINEAS. First, it 
requires more sophisticated reasoning than previously described examples. Second, it re- 
quires the integration of multiple models about the door's different aspects (e.g., the action 
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of both gravity and the hypothesized spring). Finally, it requires substantial adaptation of 
the relevant models. 

Consider the intricacies of the door's behavior. Simply applying a spring-block oscillator 
model to this system would not produce accurate enough predictions. First, as the door 
nears its closed position, a point is reached where the closing force of gravity dominates the 
opening force of the spring. This is because the door's overhang is a function of position. 
Second, consider what happens as the door oscillates. Moving further uf the track moves 
the door's anchoring line to the left, thus stretching the spring. Moving further down the 
track pulls the door's anchoring line down (or to the right), thus stretching the spring as 
wellv. This configuration produces standard spring-block oscillation, yet does so only by 
stretching the spring, never compressing it. Using analogy to understand how this garage 
door operates requires some minimal knowledge of geometry and the ability to refine an 
initial stretch-compress model into a stretch-stretch model. Even so, it is clear that there 
is a strong analogy here, showing the utility of analogy for providing an initial model from 
which to work. Developing a system that can explain the garage-door scenario will require 
further research in analogy using multiple models (e.g., extending work begun by Burstein 
(1983)), complex qualitative reasoning and modeling (e.g., Falkenhainer k Forbus, 1988), 
and spatial reasoning (e.g., Forbus et al., 1987; Joskowicz, 1987; Nielsen, 1988) 

10.3.4 Generalizing the model 

At the start of this chapter, I expressed the belief that the model PHINEAS embodies has 
broad utility. This remains to be shown. Adapting the ideas to applications in automated 
design and planning seems a natural first-step in testing their breadth and utility. Inter- 
pretation, design, and planning tasks are of a similar nature: identify a sequence of events 
that transform some initial state into a given final state (Simmons & Davis, 1987). 

There are two prerequisites to constructing the next generation PHINEAS. First, knowl- 
edge which is procedurally encoded in the program must be made declarative before task 
independence can be achieved. This includes the access belief of 

(Corr«lat«d-to cause behavior) 

as well as the same correlations assumed during the generation of candidate inferences. 
Some have already begun to address this problem (Baker et al., 1988; Clark, 1988; Davies 
k Russell 1987; Russell, 1987). Second, the a(fc?«acy definition and its corresponding eval- 
uative method must be generalized to include any problem solving activity. For example, a 
proposed design must actually produce the specified behavior. Note the similarity between 
this requirement and the adequacy requirement currently used in PHINEAS. 
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10«3«5 Scientific Discovery 

Undoubtedly, a lot of infonnation has been implicitly and unintentionally built into PHINEAS. 
How else could we explain the relative ease with which it conjectured a caloric, flowlike 
model of the thermal phenomena given it. For example, PHINEAS lacks the experiential and 
theoretical breadth and depth that makes science a tedious, complex, and often ambiguous 
process. Furthermore, the data given PHINEAS is predominately relevant and distilled. Per- 
haps the initial proposal of the caloric theory was actually this simple once all the relevant 
observational information was available (e.g., the fact that the temperatures of a hot and 
cold object placed in contact actually become equal was not discovered until the invention 
of the thermometer (Roller, 1961; Wiser & Carey, 1983)). Unfortunately, we probably will 
never know. Accurately recreating the knowledge (or lack of) existing inraiediately prior 
to the development of the caloric theory would be extremely difficult if not impossible. 

This problem is conmion to all existing work in computational scientific discovery. Fa- 
mous theories, taking months, years, or centuries to develop are somehow "rediscovered" 
in a matter of minutes or hours. Despite this obvious problem, much has been and will 
continue to be learned from such endeavors. However, soon we must attempt "the real 
thing" - apply these systems to open research problems, or at the very least, problems that 
may be considered unknown to the program developer. While the field is still too much in 
its infancy to be making such attempts, tLe time to tackle real problems is near. 

10*3.6 Integrated experiential learning 

PHINEAS and all other analogy systems built to date,* use analogy as their sole learning 
method. However, analogy, like any other single learning mechanism, is best viewed as a 
single component in a synergistic cooperation of learning methods. For example, in sci- 
entific investigation, an analogically derived hypothesis may suddenly "come to mind". 
However, this flash of insight may have been preceded by a tedious, incremental process in 
which data was collected and analyzed, patterns sought, and overall familiarity increased 
(Dreistadt, 1968; Langley k Jones, 1988). In order to build a general investigative system, 
we must integrate analogy with directed experimentation, empirical learning, and analytic 
learning. Some work on developing a general protocol enabling such interaction has al- 
ready begun (Falkenhainer k Rajamoney, 1988). However, the protocol does not answer 
the question of how an analogy system may take advantage of the work prior problem solv- 
ing and trend detection provides. Additionally, it says nothing about how the results of 

*Aii exception is the EBL system developed by Winston el al (1983), which used Winston's general 
AIALOGY program to suggest explanations for explanation-based generalization. 
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analogy should be integrated into memory. At what point does the generalization process 
often associated with analogy take place? 

10.3.7 Lifespan 

I will close this thesis by considering a somewhat loftier, long-term research problem. Even- 
tually, in buiding learning systems that are able to create new concepts and overthrow 
existing belief structures (i.e., learn at the knowledge-level (Dietterich, 1986)), we will have 
to address the problem of how knowledge evolves. True knowledge-level learning over an 
extended period of time is non-monotonic. This leads to the following problem: 

• The insecurity problem: A learning system operating in a real-world environ- 
ment and capable of augmenting, restructuring, and doubting its knowledge will 
be faced with the dilemma of deciding between doubting the validity of what 
it encounters, doubtmg the validity of its hypothesis about the encounter, or 
doubting the correctness of what it believes. 

If a new experience leads to an explanatory hypothesis that contradicts existing beliefs, 
should the system question the observation, its hypothesis, or its prior knowledge? The inse- 
curity problem also emcompasses the experiential consistency problem (Rajamoney, 1988b): 
a new hypothesis must be consistent with the entirety of a system's past experiences. This 
is an untenable requirement in its absolute form. Yet, once exceptions to beliefs are allowed, 
we are once again faced with the problem of knowing where and when to doubt. 

This is a particularly acute problem in an analogical learning system like PHINEAS, 
which is capable of proposing hypotheses about an observation that are consistent with the 
observation, but conflict with existing beliefs. PHINEAS has been restricted to emulate an 
essentially a monotonic learning system. The only beliefs PHINEAS is allowed to overthrow 
are closed world assumptions about the breadth of its knowledge. If an existing belief is 
contradicted, a new concept is generated rather than attempt to overthrow that existing 
belief. That restriction effectively avoids the problem. For example, when considering a 
slight modification of an existing concept, should the old concept be generalized, modified 
by addition of disjuncts, or a new concept created? In PHINEAS, a new concept is proposed 
along with detailed information about how the concept was created. The storage issue 
concemed with what to do with the proposed concept is viewed as external to PHINEAS. 

The comfort of maintainmg a constant deductive closure is very seductive. As soon as 
you step beyond the deductive closure, you effectively enter the twilight zone. The reasoning 
agent can actually question the validity of its own beliefs. Insecurity sets in. This is part of 
the current gap between machine learning systems and what might be called an intelligent, 
learning agent. However, this gap is what I believe makes the research interesting. 
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Appendix A 

Note on Abductive Backchaining 



As described in Chapter 5, the abductive retrieval task is to find consistent instantiations 
for a given well-formed formula p. If 

/) = Ci A Cj A . . . A Cjv 

and only a subset of its elements may be shown, the remaining conjuncts are assumed, 
contingent on their joint consistency. If p contains free variables, this task requires consid- 
eration of the N-M binding problem. For example, given the goal to show 

P{tz) A Q{U) A R{U) 

with P(o), Q(6), and R{h) believed true, the abductive retriever should return two possi- 
bilities: 

Binding Assumptions 
1: (U. a) Q(a) A R(a) 
2: (tz.h) P(b) 

Solving each su^^oal sequentially will prevent the second possibility, because the values for 
?« after solving for P(?«) are lixnited to {a}. 

This appendix begins with an overview of the abductive retriever used in PHINEAS. It 
then presents the method by which conjunctive goals are solved. Throughout the discussion, 
it is assumed that knowledge is expressed as horn clauses. 
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Figure A.l: Monitors manage communication between subgoals and parent goal. (1) Re* 
cdve a response from a subgoal. (2) Have all the subgoals responded? (3) If so, generate 
the current set of possible answers. (4) Return those answers to the parent goal. 



A.l Monitors 

Each goaJ is managed by a monitory which coordinates efforts to solve the goal and sends 
responses to ivs parent goal each time a new solution is found (Figure A.l). Monitors are of 
three types, corresponding to the goal managed: atomicj conjunctive^ and hachchain. Mon- 
itor execution is controlled by an agenda, which is an ordered list of monitors corresponding 
to the leaves of the search tree.^ 

Responses from a subgoal to its parent goal consist of {answers^ response-type) pairs. 
There axe three response types. A conplet^ reponse indicates that no further answers are 
possible beyond the set of answers currently being returned. A partial response returns 
an existing set of answers and indicates that additional responses may be forthcoming. For 
example, a goal to show an atomic formula will both perform a fetch for existing instances, 
which are returned through a partial response, and spawn a backward chaining subgoal 
to seek additional possibilities. Finally, a fail response indicates that no answers have 

^Monitors an placed in oxdex of inczeasiiig depth, prodndng breadth-fint search. A more sophisticated 
sttategj would take into consideration the plausibility of assumptions, complexity and specificity of the 
explanation, and the oment success of alternate explanations. 
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been found and will never be found. For example, a backchaining monitor will immediately 
send a fail response if there are no applicable rules. A monitor responding with eonipl«t« 
or fail is considered closed. For example, an atomic goal will close with a response of 
conplet* if its initial fetch for existing instances is successful and its backchaining subgoal 
fails. 

A. 2 Conjunctive Goals 

The retiieval task may be reformulated in terms of the free yaxiables {»i , rj, . . . , t;^} con- 
tained in p: find a substitution 0 which associates each Vi to a known entity such that 
p6 is believed consistent in some ATMS context. In this view, each retrieved instantiation 
Cy of conjunct d supplies new possible bindings for the free variables it contains. Each 
possible value for a free variable is implicitly represented as an ATMS node corresponding 
to the assignment ^Vi=valuej and is supported by the instantiated nodes using that value 
for Ivi.* For example, if the task is to retrieve 

P(^x,^y)^Q(^y,U) 

then retrieval of /^3,4), /^3,5), and ^4,7) produces the following justifications 

iX3,5) ^y=5 
0(4,7) ?y=^ 
g(4,7)-. ?z=7 

With this representation, the retrieval task is to find all consistent bindings containing 
exactly one selection from each variable's choice set: 

Ivi: (( tvi-valutij 
...) 



Hi is u implidi ATMS node in that the node is never actually created. Rather, its lahtl (all minimal 
sets of assumptions nnder which the node is beliered) is computed and stored separately by the abdnctive 
retrieyer. This is a significant efficiency sayings and eliminates the possibility of the ATMS database becoming 
oniTersally contradictory. This can happen when ^two alternate values for have absolute support. 
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There are two senses of consistency relevant to this operation. First, unification con- 
sistency of the bindings most be maintained. This is assured by the unification procedure 
used to retrieve the instantiated expressions and the criterion that only one selection from 
each variable's choice set is allowed. Second, each instantiation for the conjunction p must 
be consistent with existing knowledge. 

The algorithm for solving a conjunctive goal is shown in Table A.l. The task begins 
by activating a subgoal for each conjunct dj followed by a period of waiting for an initial 
response from each subgoal. As subgoals respond, the stored set of possible instantiations 
for p's free variables is updated. Once each subgoal has responded, the conjunctive goal's 
monitor computes all possible substitutions ${ from the existing choice sets. Only those 
substitutions supported by an existing ATMS context are retained. This is determined by 
computing the conjunction of the individual binding nodes ( Ivi-valuej) contained in a 
given 0i and verifying that this conjunction is believed in some context.^ As stated, this 
has the possibility to form an instantiation for p in which none of the conjuncts are believed. 
For example, given the goal 

P(?a:,?y) A (?(?», ?y) 

Finding P(a,6) supports assumption of Q(a,6) and finding Q{cjd) supports assumption 
of P{cjd). Together, these candidates suggest the instantiation P{a,d) A Q{a,d). Such 
unsupported candidate instantiations are rejected. 

Note that lack of an ATMS context in which (Coniistrat pO) is believed does not imply 
p9 is inherently inconsistent. It simply means that existing assumptions supporting the 
proposed instantiation of p are mutually inconsistent. Alternate, consistent assumptions 
supporting the same instantiation could arise in the future. However, assuming p9 in the 
hope of future consistency is deemed too weak as a basis for making abductive assumptions. 



^he ATMS label fot a node ( ?t^=iMi/tte/) indicates all minimal sets of assumptions under which the node 
is beliered. The existence of a context supporting a given Unding set is Terified by computing the label 
for the coi^imction of nil nodes ( liH^valuej) supporting the bmding. If the label is non-empty, there is 
at least one context m which p8 ii believed. 
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Table A.l: Abductive retrieval for conjunctive goals. 

1. 5pawi an atomic $%bgoal on each conjunct {Cu • • m ^n} 

2. Wait for initial rt$pon$e$ 

Repeat Until all coigtuicts have responded: 

(a) Receive response from C,- 

(b) If response type = fail and 

Ci is dosable via closed world assumption or C,- is the only coxyimct containing variable 

return fail. 

(c) Store suggested bindings from Ci 

3. Generate binding$ cro$$ product 

(a) For each candidate binding determine if ptf is supported by an ng ITHS con- 
text (i.e., its label is non-empty). Unsuppwted bindings are -^cv! .d due to their 
inconsistency. 

(b) For each accepted binding 0, assume unknown coiguncts in p6. 

(c) Respond 

If there are binding sets and every Ci task is closed 
Re8pond(6tiu<f niseis* coaplete) 

If there are binding sets and there is a Ci task that is not closed 
Ke8pond(6tiu<ffi9-se(s» partial) 

If there are no bindmg sets and every Ci task is closed 
ReapondClZL, fail) 

4. Eztend existing po$sihilitie$ 
Repeat Until every Ci task is closed 

(a) Receive response from Ci 

(b) Store suggested bindings from 

(c) Extend existing binding cross product 

If there are extensions and every C,- task is closed 
RespondCfrtnitnj^sets, coaplete) 

If there are extensions and there is a d task that is not closed 
RespondCfrtfufin^sets* partial) 

If there are no extensions and every d task is closed 
RespondClZL* coaplete) 
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Appendix B 

Contextual Structure Mapping Rule 
Set 

Contextual structure mapping is modeled by the CSM rule set given to SME, which defines 
SMEcsM. This single rule set was used without change in every PHINEAS example described 
in this thesis. Chapter 3 described these rules a i gave the motivations behind each. Here, 
the complete set of lisp rules used in SJtEcsM are listed. With each rule is an indication of 
the predicate calculus rule(s) from Chapter 3 it implements. The name and numbers for 
each rule refer to the rule's description in Chapter 3. 

t 

B.l Step 1: Local Match Construction 

Given two dgroups, SME begins by finding potential matches between items in the base and 
target. Allowable matches are specified by match constructor mles, which take the form: 

(MHCrule (.{Trigger) {BaseVariabU) {TargetVariable) /rtest {TestForm)]) 
{Body)) 

In all match constructor rules, {Body) is executed in an environment in which {BaseVariable) 
and i^rargttVariahlt) are bound to items from the base and target dgroups, respectively. 
If {TeatForm) is present, the bindings must satisfy the test (i.e., the form when evaluated 
must return non-NIL). There are two possible values for {TestForm). A : filter trigger 
indicates that the rule is applied to each pair of items from the base and target. These rules 
create an initial set of match hypotheses between individual base and target expressions. 
An : intern trigger indicates that the rule should be applied to each newly created match 
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hypothesis, binding the variables to its base and target items. These rules create additional 
matches suggested by the given match hypothesis. 

;;; Rule 1 (Same Ftnciors) 

(MHCtuI* (:fUt«r ?b ?t :t«st (and (•q (•xpr^iiion-lunctor ?b) (•xpr«stioii-functor ?t)) 

(not (pairtd-it«? :ba8«-itra (•xpr«Mion-lunctor ?b))) 
(not (pairtd-itM? :targ«t-itw (•xprMiion-lnnctor ?t))))) 

(install-NH ?b ?t)) 
;;; Rule t (Funciionalljf Analogous) 

(mc-nla (: intern ?b ?t :t«tt (and (implicational? ?b) (ii^licational? ?t))) 
(vhn (and (tq (axprast ion-functor (iaplicational-consoqant ?b)) 
(•xprottion-fuctor (ii^licational-consoqiimt ?t))) 
laplying a logical connactiva doaan't count 
(not (logical? (azpraaaion-f victor (iaplicational-conaaqnant ?b)))) 
; ; A logical connactiva iaplying aonathing doaan't count 
(not (logical? (azpraaaion-^f victor (iiplicational^antacadant ?b))))) 
If atOBic consaqaanta aatch, than aatcli atonic antacadanta 
(iaatall-MH (inplicational-antacadant ?b) (iin)licational-antaca<iant ?t)) 
(.^na:a8aart! * (f unction-of .(inplicational-antacadant ?b) 

(aupporta .(iaplicational-conaaqnant ?b)))) 
(anarassarti * (provid«a-f unction .(inplicational-antacadant ?t) 

(aupporta .(iaplicational-conaaquant ?t)))))) 

Rule S (Sanction d Pairing) 

{rac-mla (rfiltar ?b ?t :tact (aanctionad-pairing? (azpraa a ion-functor ?b) 

(azpraiaion-functor ?t))) 

(inatall-IIH ?b ?t)) 

,V; Rnles 4-6 (Non-Commuiaiive Corresponding Arguments) 

(MHC-mla (:intam ?b ?t :taat (and (axpraaaion? ?b) (aspraaaion? ?t) 

(not (comtativa? (axpraaaion-functor ?b))) 
, (not (coHtttativa? (azpraa a ion-functor ?t))))) 

(do ((bchildran (axpraaaion-argonanta ?b) (cdr bchildran)) 
(tchildran (a^aaaion-arguanta ?t) (cdr tchildran))) 
((or (null bchildran) (null tchildran))) 
(cond ((and (antitj? (firat bchildran)) (antity? (firat tchildran)) 
(or (aanctionad-pairing? (antity-nana (firat bchildran)) 

(antity-nana (firat tchildran))) 
(and (not (pairad-itan? rbaaa-itan 

(antity-naaa (firat bchildran)))) 
(not (pairad-itan? :targat-itaB 

(antity-nana (firat tchildran))))))) 
(inatall-MI (firat bchildran) (firat tchildran))) 

((or (antity? (firat bchildran)) (antity? (firat tchildran)))) ;quit 

((and (function? (azpraaaion-fnnctor (firat bchildran))) 
(function? (axpraaaion-functor (firat tchildran))) 
(not (pairad-itan? :baaa-itan (aspraaaion-functor (firat bchildran)))) 
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(not (p%irod-itn? :targ«t-itw (•xprMsion-fnnetor (first tchildru))))) 
(install-Ill (first bchildrsa) (first tchUdrw))) 

((•Kd (not (pidrod-itw? :bass-it«s (ozprossioB-fiinetor (first bchUdron)))) 
(Mt (pairod-itM? .-tarfst-itM (•zprsssion*fimetor (first tchildrsn)))) 
(not (fnaetion? (•zprossion-funetor (first bchildrta)))) 
(not (function? (•xprsssion-fnnetor (first tchildrsn)))) 
(prsdieats-typs-intsrssction? (osprossion-fnnetor (first.bchildrsn)) 

(•zprsssion*fuetor (first tchildrsn)))) • 
(instnll-NI (first bchildron) (first tchildrtn)))))) 

Rnlcs 7-P (Commniaiive Com$pondins Arynmenis) 

(nc-mlo (:intsm ?b ?t :tsst (and (szprsssion? ?b) (ozprossion? ?t) 

(coMnitatiTo? (•xprossion-fnnetor ?b)) 
(coamtatiTs? (oxprossion-fnnetor ?t)))) 
(dolist (bchild (ssprsssion-argiuionts ?b)) 
(dolist (tchiU (sxprsssion-arguiaats ?t)) 
(cond ((and (antity? beldld) (antity? tchild) 

(or (sanctionad-pairing? (antity-n»jia bchild) 

(antity-naaa tchild)) 
(not (pairad-itw? :basa--itan (antity-nana bchild))) 
(not (pairad'-itw? :targat-itaB (ontity-sMa tchild))))) 
(inatall-n bchild tchild)) 
((and (function? (azprassion-fnnctor bchild)) 
(function? (aiprassion-fimctor tchild)) 
(not (pairad-itaa? :basa-itan (azprassion-fnnctor bchild))) 
(not (pairad-itaa? :targat-itss (azprassion-functor tchild)))) 
(inatall-m bchild tchild)) 

((and (not (pairad-itaa? :basa-itM (azprassion-fnnctor bchild))) 
(not (pairad-itsa? :targat-itaB (azprassion-ftinctor tchild))) 
(not (function? (azprassion-fnnctor bchild))) 
(not (function? (azprassion-fnnctor tchild))) 
(pradicata**typa-intarsaction? (azprassion-fnnctor bchild) 

(azprassion-fnnctor tchild))) 

(inataU-NI bchild tchild)))))) 



B.2 Step 2: Global Match Construction 

Once an initial set of match hypotheses is formed, the pairwise consistency of match hy- 
potheses stated by Conflicting(AfH{bi,tj)) is used to combine them into maximal, consis- 
tent gmaps. Rules are used to define the contents of Conflicting for each match hypothesis 
and are executed by the pattern-directed rule engine attached to the BMS. Their syntax is 
slighly different than those described above. Rather than being assigned specific base and 
target items, each trigger contains a pattern which must unify with a known BMS datum. 

Rnla lO-lS (One-io^one conairaints) 
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n; 0]i«*to*oa« (•spr«siio&s t tntiti«i) bM« east 

(n7.« ((:iatm (in ?b ?tl)) 

(:iattn (n Tb ?t2) :tMt (not (*q ?tl ?t2):)) 
(ConflictiBf (f«tch-^ 7b Ttl) (f«tch*iih ?b ?t2))) 

oii«-to-on« (•ipr«ssio&s t tttititt) target cai« 
(rnl« ((:iat«n (n ?bl ?t)) 

(:iat«ra (NB ?b2 7t) :t«it (not (tq ?bl ?b2)))) 
(Con<licti&8 (f€tch«ah ?bl 7t) (f«tch^ ?b2 ?t))) 

oa«*to-oa« (pr«dieat«i) bo« cut 
(nl« ((:iat«n (NH ?bl ?tl) :t«it (and (•zprtisioa? Tbl) (•xpr«uioa? ?tl))) 
(:lattn (n ?b2 ?t2) :t«it (and (•xpr«isioa? ?b2) (•zpr«uioa? ?t2) 

(•q (•xprcisioa-fTmetor ?bl) 
(•zprcisioa-fimetor ?b2)) 
(not (aq (•zprMsion-fim<*tor ?tl) 

(•zprMsioa^-fuetor ?t2)))))) 
(Coaflictiag (f«tcb-ah ?bl ?tl) (fttch-iili ?b2 ?t2))) 

;;; oaa*to*oa« (pradicatas) targat caaa 

(nla ((:iatan (n ?bl ?tl) rtait (and (axpraision? Tbl) (azpraision? ?tl))) 
(rlatara (NH ?b2 ?t2) :tast (and (axpraition? ?b2) (azprauion? ?t2) 

(not (aq (azpraision"-fnnetor Tbl) 

(azpraision"-fnnetor Tb2))) 
(aq (azpraition-fimetor Ttl) 
(azpraition-fimetor Tt2))))) 
(Conflicting (fatcb-ah Tbl Ttl) (fateh-ah Tb2 Tt2))) 

;;; Rnk I4 (Tempwl prtservMiion) 

(nla ((:iatan (m Tbl Ttl) :tast (and (taaporally-seopadT Tbl) (taaporally-icopadT Ttl))) 
(:iata» (MI Tb2 Tt2) :taat (and (taivorally-seopadT Tb2) (taaporally-icopadT Tt2)))) 
(nnlait (or (and (saaa-tiaaT Tbl Tb2) (taaa-tiaaT Ttl Tt2)) ;both laaa 

(and (diffarant-tii^jT Tbl Tb2) (diffarant-tiaaT Ttl Tt2))) ;both dillarant 
(Conflictiag (fatcb-ah Tbl Ttl) (fatch-ah Tb2 Tt2)))) 

;;; Rule 15 (Componni object rt^rrangemeni: eoniainedMqftid case) 

(rala ((tictan (KI Tbl Ttl) :tait (and (containad-liqnidT Tbl :bata-itaa) 

(coatainad-liqnidT Ttl :targat-itaa))) 
(:in«ara (Ml Tb2 Tt2) :tait (or (and (containar-ofT Tbl Tb2 :basa-itaa) 

(aot (c^ataiaar-ofT Ttl Tt2 :targat-itaa))) 
(and (not (containar-olT Tbl Tb2 :targat-itaa)) 
(coataiaar-ofT Ttl Tt2 :baaa*-itaa))))) 
(Coallictlag (fatcb-ah Tbl Ttl) (fatch-ah Tb2 Tt2))) 



B*3 Step 4: Compute evaluation scores 

Once the gmaps have been formed and their candidate inferences determined, SME concludes 
by assigning each gmap a match evaluation score. Managed by the BNS, these rules supply 
evidence through the form Impliaa ( (antecedent) , (consentient) , (weight) ) . 
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;;; Rule 16 (Minimal Ascension) 
(iaitial-Mtcrtioft (ni«:a8t«rt! 'typc-uteh)) 

(nit ((:iat«ni (n ?b ?t) :t«tt (and (•xpr«ttion? ?b) (•zpr«ttion7 ?t) 

(not (paired" itn? :bMa-itM (•xprattion-fnnctor ?b))) 
(not (pairad-itra? :targat"itra 

(azprattion^fnnctor ?t)))))) 
(lot ((pi (Mp-path-langth (azprattion-functor ?b) (azprattion-fnnctor ?t)))) 
(whtn (and (nnabarp pi) (> pi 0)) 

(•M:a8sart! '(jtepliat typa-aatch (Ml .?b .?t) (.(/ 0,4 pi) • 0,0)))))) 

;;; Rule 17 (Sanctioned Pairing Etiience) 

(initial-aasartion (sBa:assart! 'sanctionad-pairing)) 

(nla ((:intarn (NH ?b ?t) :taat (and (azprassion? ?b) (azpraaaion? ?t) 

(aanctionad-pairing? (azpraaaion^fnnetor ?b) 

(azpraaaion-fnnetor ?t))))) 
(sMtraaaart! (iapliaa aanetionad-pairing (NH ?b ?t) (0.4 . 0.0)))) 

(rnla ((:intarn (NH ?b ?t) :taat (and (antitj? ?b) (antity? ?t) 

(aanctionad"pairinf;? (antity-naaa ?b) 

(antity-naaa ?t))))) 
(aM:rasaart! (impliaa aanctionad-pairing (NH ?b ?0 (0.4 . 0.0)))) 

;;; Rule 18 (FunctionaUy Analogous Etidence) 
(initial-aaaartim (anaiaaaart! 'aaaa-rola)) 

(nla ((:intarn (NH ?b ?t) :taat (and (axpraaaion? ?b) (axpraaaion? ?t))) 
(:intarn (fnnction-of ?b ?f) :Tar ?f-of) 
(:intan (proTidaa-f unction ?t ?f) :nx ?p-f )) 
(aBa:raaaart! (iq^liaa (and ?l-ol ?p-l) (NH ?b ?t) (0.8 . 0.0)))) 

;;; Rules 19-20 (Sg$iemaiiciig) 

;;; Hon-eoamtatiTa cnaa 

(nla ((:intan (NH ?bl ?tl) :taat (and (axpraaaion? ?bl) (azpraaaion? ?tl) 

(not (eoHatativa? (axpraaaion^fnnetor ?bl))))) 
(:int«z».(NH ?b2 ?t2) :taat (cbildran-of? ?b2 ?t2 ?bl ?tl))) 
(aM:raaa«rtl (iifpliaa (NH ?bl ?tl) (NH ?b2 ?t2) (0.8 . 0.0)))) 

;;; Coaamtativa eaaa 

(nla ((:intarn (NH ?bl ?tl) :taat (and (azpraaaion? ?bl) (azpraaaion? ?tl) 

(eonmtatiTa? (azpraaaion-fonetor ?bl))>) 
(:intan (NH ?b2 ?t2) :taat (and (aaabar ?b2 (azpraaaion-arguanta ?bl) :taat #'aq) 

(■aabar ?t2 (azpraaaion^argnnanta ?tl) :taat #'aq)))) 
(aM:raaaart! (iqpliaa (NH ?bl ?tl) (NH ?b2 ?t2) (0.8 . 0.0)))) 

;;; Rule 21 (Behavioral) 
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(initial^ustrtioB (nt:ast«rt! 'bthaTioral)} 

(nit ((:i&t«n (ifll ?b ?t) :tMt (and (txprMtion? ?b) (txprtttion? ?t) 

(bthaTioral-rtlatioB? ?b) (bchaTioral-rtlation? ?t)))) 
(ratzraitart! (i^plits behavioral (MI ?b ?t) (0.4 . 0.0)))) 

;,V Rule ti (Provides Relevant Inference) 

(nit ((tinttra (CI ?gMtp (B-Ezplaias Tthaory ?b«haTior)) 
:ttat (Cnrrant-obttrratioB ?b«haTior))) 
(satf (gatf (pi-pliat TgMp) :rtltTaiit?) T)) ;ha&dlt •▼idtnet in gup rula 

One final rule is needed to sum ail the evidence for each gmap. For efficiency, the sum 
is computed directly rather than going through the BMS justification mechanism. 

(nla ((:iatan (GNAP ?pi))) 

(satf (Bodt-b«litf> (gs-bu-Bodt 7gn)) 0) 
(dolist (ah (gii-al«iaiits ?pi)) 

(inef (nodt-baliaf> (gn-^bu-noda ?pi)) (Boda-btliaf'f (Hh-bu-Boda ah)))) 
(if (gatf (pi-plist ?^) zralava&t?) 

(inef (BOda-baliaf> (pi-bu-noda ?pi)) 0.9))) 
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Appendix C 

Phineas Initial Domain Knowledge 



PHINEAS begins each explanation task with a collection of qualitative theories about physical 
processes, entities, and general physical rules. Except where indicated in Chapter 9, the 
initial set of domain knowledge was the same for all examples described in this thesis and 
represents the contents of this section. A few special cases required additions or deletions 
to the basic set. For example, all knowledge concerning heat phenomena was removed for 
the caloric heat flow task. 

All domain knowledge is managed by PHINEAS' ATMS problem solver. This consists of a 
pattern-directed rule engine and the ATMS itself. Because QPE requires access to the same 
domain knowledge that PHINEAS possesses, all rule and process declarations are switch- 
able, in that they may expand into either a PHINEAS declaration or a QPE declaration. 
This is indicated by the asterik in the form name* and is controlled by a global flag, 
♦qp-toggle-8witch«. In some cases, explicit references to packages mem and adb are used, 
which refer to PHINEAS' problem solver and QPE's problem solver, respectively. Package 
translations are sometimes made depending on the value of ♦qp-toggle-switch*. 

C.l Rules 

PHINEAS rules appear in two forms. The rule* construct indicates a forward- chaining rule 
which always runs whenever possible. These are used strictly for declaring inconsistencies 
(e.g., (gas ?x) is inconsistent with (liquid ?x)). The «— ♦ consequent antecedents) 
form expands into two rules. One is an exhaustive, forward chaining rule (i.e., rule*). The 
other is stored for backward chaining purposes by PHINEAS' abductive retriever. 

;;; Closed World Assnmptions 

;;; Astuat v^l spatial rtlatioiuhipt ar« subjtct to clottd world astoBption 
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(■w:d«fClos«bl« To«ehiag) 
(H«i:d«fClos«bl« Pli7tic«l<-Patli) 
(■«i:d«faos«bl« lHi«rs«d-in) 
(■«i:d«lClos«bl« Spring) 
(■«i:d«faos«bl« Block) 
(■«i:d«fClos«bl« B«ikw) 

PkM$€ 

;;; Miut \>% in only on« statt at a tint 

(na#» :int«n ((solid ?obj)) (aMzat-Mst-ono '((solid .?obj) (liquid .?obj) (gas .?obj)))) 
(mlo* :inttn ((liquid ?obj)) (asazat-Mst-ono '((solid .?obj) (liquid .?obj) (gas .?obj)))) 
(rulo* :int«n ((gas ?obj)) (aiazat-Mst-ons '((solid .?obj) (liquid .?obj) (gas .?obj)))) 

(not (Fluid ?11)) ((Solid ?fl))) 
«««♦ (not (Solid ?11)) ((Fluid ?fl))) 
««•♦ (Fluid ?11) ((Gas ?11))) 
(<—♦ (Fluid 7fl) ((Liquid ?11))) 

;;; A QPE specific ruk 

(adb:rulo :intarn ((((procoss-instanco ?nano) ?inst) . :TRUB)) 
(adb:rass«rtl ((procoss«instanco-of ?naao ?inst) . :TRUB))) 

MuctUanto%$ nogood$.,. 

(rul#» :intsn ((containor-ol ?obj ?obj)) ;soBOthing can't contain itsoll 

(■«a:rassart! ((containar-ol ?obj ?obj) . :FALSB))) 

Tlio inoqoality tazoaony 
(■sB:rulo :intorn (((groator-than ?nl ?n2) . ?cond)) 
(■sa:tazonoB7 (list (list 'grsatar-than ?nl ?n2) 
(list 'squal-to ?nl ?n2) 
(list aoss-thaa ?nl ?n2)))) 
(■sa:rulo :intarn (((oqnal-to ?nl ?n2) . ?cond)) 

(■«a:rofsront '((groatsr-than ,?nl ,?n2) . ?coad))) 
(■sa:rulo :intarn (((loss-than ?nl ?n2) . ?cond)) 

(Bsa:rofarant '((groatsr-than .?nl .?n2) . ?cond))) 

(■sa:rulo rintarn (( (grant ar-than ?nl ?n2) . ?cond)) 
(BS9:r justify ((lass-thaa ?n2 ?nl) . ?cond) 

(((graatar-thaa ?nl ?n2) . ?cond)))) 

(■•■:rula :intam (((graavar^han ?nl ?n2) . :TEUB)) 
(■«a:rjustif7 ((graatar^than ?n2 ?nl) . :FALSB) 

(((graatar-thaa ?nl ?n2) . :TRUB)))) 

ConUinmeni... 

(<««♦ (not (Can-absorb ?solid ?fl)) 

((Fluid ?n) (Solid ?solid) (not (Porous ?solid)))) 

(<»«♦ (not (Can-contain ?containar ?11)) 
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((Fluid ?fl) (VoluM-Solid ?eontain«r))) 

(na«* lintm (((c«n-coatain ?nl ?nl) . ?cond)) ;ioa«thiiig can't contain itaalf 
(■«i:rasa«rtl ((can-contain ?nl ?nl) . :PALSE))) 

Spatial rtlation$hip$... 

(<««♦ (Tonching ?objl ?obj2) 

((iMiartad-in ?objl ?obj2))) 

i<mm0 (ToneJiing ?cl ataoiphara) 
((Containad-Liqnid ?cl) 
(Containar-of ?cl ?containar) 
(Opan ?containar))) 

(<««♦ (Graatar-than (A (eontact-anrfaca ?objl ?obj2)) zaro) 
((ToQChing ?objl ?obj2))) 

(<■«♦ (quantity (contact-inrfaca ?objl ?obj2)) 
((Touching ?objl ?obj2))) 

Wa naad to do tha naxt ona twica bacauaa QPE naads to go through ita aacro azpandar 
(Qprop (contact-iurlaca ?objl ?obj2) (aaonnt-of ?objl)) 
((Inwaad-in ?objl ?obj2))) 

(adb:rala :intarn ((inariad-in ?objl ?obj2)) 

(inatall-runtiaa-qprop '(contact-au^faca ,?objl ,?obj2) 

'(Qprop (contact-aurfaca ,?objl .?obj2) (aaount-of ,?objl)) 
'(((iMaraad-in ,?objl ,?obj2) . :TRUB)))) 

;,V Paths... 

(rula* :intarn (((Phyaical-Path ?objl ?obj2 (CoMon-Faca ?objA ?objB)) . :TRUE)) 
(unlaia (and (or (aqoal ?objl ?objA) (aqual ?objl ?objB)) 
(or (aqual ?obj2 ?objA) (aqual ?obj2 ?objB)) 
(not (aqual ?ob{A ?objB))) 
(■M:raaaart! ((Phyaical-Path ?objl ?obj2 (Cow>n-Faca ?objA ?objB)) . :FALSE)))) 

(rulaa :intam (((Phyaical-Path ?obj ?obj ?path) . :TRUB)) 

(■M:raaaartl ((Phyaical-Path ?obj ?obj ?path) . : FALSE))) 

(rulaa :intam (((Containar-of ?obj ?canl) . :TRUB) 

((Containar-of ?obj ?can2) . :TRUE) :taat (not (aqual ?canl ?can2))) 
(■M:at-Mat-ona '((Containar-of .?obj ,?canl) (Containar-of ,?obj ,?can2)))) 

(<««a (Phyaical-Path ?objl ?obj2 (Conon-Faca ?objl ?obj2)) 
((Touching ?objl ?obj2))) 

(<««a (Touching ?obj2 ?objl) ((Touching ?objl ?obj2))) 

(rula* rintam (((Touching ?objl ?path) . :TRUE) 

((Touching ?path ?obj2) . :T»UE) :tait (not (aqual ?objl ?obj2))) 
(M:rjuatify ((Phyaical-Path ?objl ?obj2 ?path) . :TRUE) 
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(((ToucMag ?objl ?path) . :TEOB) 
■ ((Tondilm ?path ?obj2) . rTROB)))) 

(nlM :int«n (((Solid ?obj) . :TEUB)) 

(M:raasiiBt! ((Voluit-Solid ?obj) . :TROB) :OEFAULT-OI-SOLID)) 

(<■•• (not (Flnid-Path ?obj)) ((Voliao«-Solid ?obj))) 
(<»»• (not (nnid-Path ?obj)) ((6as ?obj))) 
(<««• (not (nuid-Path ?obj)) ((Liquid ?obj))) 

(<«•♦ (Flttid-Path (Cowm-Faco ?obj2 ?objl)) ((Fluid-Path (CoMon-Faco ?objl ?obj2)))) 

i<mm0 (Aligaod (CoMOB-Faco ?objl ?obj2)) ((Touching ?objl ?obj2))) 

(<««♦ (Siqpporta-Flow (Cowon-Faco ?objl ?obj2)) ((Touching ?ob)l ?obj2))) 

(<«»♦ (Flowing-Fluid Tlluid) ((Flowing-Air ?llui4))) 

(<««♦ (Equal-to (A (Anount-of-at ?stuff ?part)) zoro) 
((Quantity (Anount-of-^at ?stuif ?part)) 
(not (Touching ?stuff ?part)))) 

;;; Mi$ctlUnto%9 object properties... 

(<"♦ (hoat-path ?obj) ((boakw ?obj))) 

(<— ♦ (can-contain ?obJ alcohol) ((boakor ?obj))) 

;;; Linear ani angular gpaiial quantities... 

(rulo* :intorn ((linoar-q ?q)) 

(M:at-Mat-ono '((linoar-q ,?q) (angular-q ,?q)))) 
Crulo« :intorn ((angular-q ?q)) 

(nw:at-nost-ono '((linoar-q ,?q) (angular-q ,?q)))) 

C<««* (Linoar-q ?dq) ((D«riwatiwt-ol (?q ?obj) (?dq ?obj)) 

(Linoar-q ?q))) 
[<«♦ (Linoar-q ?q) ((Dtriwatiw«-ol (?q ?obj) (?dq ?obj)) 

(Linoar-q ?dq))) 
[<mmm (Angular-q ?dq) ((Owiwatiwo-of (?q ?obj) (?dq ?obj)) 

(Aagalar-q ?q))) 
:<•«♦ (Angular-q ?q) ((D«riwatiwo-ol (?q ?obj) (?dq ?obj)) 

(Angular-q ?dq))) 

[<«»♦ (Linoar-q ?acc) ((Forco-Application (?forco ?»rc) (?acc ?dft)) 

(Linoar-q ?foreo))) 
:<•»♦ (Angular-q ?acc) ((Foreo-Application (?forco ?wrc) (?acc ?dst)) 

(Aagular-q ?forco))) 

The styrofoam cup example 

<»»♦ (Op«-Concawity ?obj) ((Opon-Conical ?obj))) 
<*«♦ (Opw-Concawity ?obj) ((Opon-Cylindor ?obj))) 
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i<mmm (stmblt ?obj) ((Flat-Bottom ?obj))) 

i<mmm (iMiaAtM-Itat ?obj ) ((lu-]iaadl« ?obj))) 
(<■«♦ (laaulatM-lMt ?obj) ((Stjrofoaa ?obj))) 

(nao :iat«xn (((Substance 7mbl) . :TRUB) 
((?«ttbl ?obj) . :TWJB) 

((Substance ?«iib2) . :TRUE) :tMt (not (aq ?«ubl ?8ttb2))) 
(B«a:rjiiitif7 ((?Mb2 ?obj) . :FALSE) 

(((?«ttbl ?obj) . :TWJE)))) 

(assart* '(Snbstaaca aatal)) 
(aaaarta '(Subatanca atjrrofoaa)) 

(nla* :iatan ((opan-coaical ?obj)) 

(■«:at-Mat-ona '((opaa-conical ,?obj) (opan-cylindar .?obj)))) 
(rtaa* ;iatan ((opan-cyliadar ?obj)) 

(■«;at-aoat-ona '((optn-coaical .?obj) (opan-cylindar .?obj)))) 

(daf?iav« (Stjrofoaa-ciqp-plan ?cap) 
ladiTidnaXa ((?cap ;co&ditiona 

(Opaa-coaical ?cap) 
(Flat-bottoa ?cap) 
(Styrofoaa ?ciip) 
(Vaight ?cap aix) 
(ioa*poroaa ?ciip))) 
Ralationa ((lot-Cup ?cap)) 
Cacba ((Sapperta (Opaa-Coaical ?cap) 

(praraq (Caa-Pour Tcap) (Opan-ConcaTity ?cap))) 
(Si^porta (nat-Bottoa ?cap) 

(praraq (Can-contain ?cttp ?iiquid) (Stabla ?cap))) 
(Sapporta (Styrofoaa ?cap) 

(praraq (Can-contain ?CTip ?liquii) (Inaolataa-Haat ?cup))))) 
;;; Defining quantity tjfpe$for QPE... 

(dafQnantity-Typa Aaoant-of Individaal) 
(dafQoantity-l^ ?T%Mm% ladividual) 
(dafQoantity-Typa Praanra-in ZndiTidaal) 
(dafQaantity-^pa Flov-Eata Individaal) 
(dafQaantity-Typa laat-Flov-Iata Individual) 

(dafQuantity-Typa laat Individual) 
(dafQuantity-Typa Toq^aratura Individual) 

(dafQuantity-Typa Changa-Rata Individual) 
(dafQuantity-Typa Vaporization-Rata Individual) 
(dalQuantity-Typa Diaaolva-Iata Individual) 
(dafQuantity-Typa Surfaca-Araa Individual) 
(dafQuantity-Typa Contact-Surfaca Individual Individual) 
(dafQuantity-Typa Concsntration Individual) 
(dafQuantity-Typa Saturation-Point Individual) 
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(defQiuntityTyp^ TMp«rAtiir«-iii IndlTldnal) 
(d«fQiMntity-1^ Tbeil ladlTidual) 
(d«fQiMntity-Typ« Tfr«M« ZadiTidnal) 

(d«fQmntity-Typ« Forc« XadiTidul ladiTidnal) 
(d«fQmntit7*Typ« latmal-Fore* ZndiTidiial) 
(dtfQuBtity-l^ Torque ladiTidul) 
(dtf Quant ity-Typ« lMtontiT«-Forc« Individaal) 
(d«fQ««itity-Typ« Lngth ZadiTidul) 
(d«fQuntity-Typ# Ust-Ltngtli ZadiTidul) 
(d«fQuntity-Typ# DisplacM«at IndiTidiial) 
(d«fQuntity-1>p« Aagular-Dlsplacwrat ladiTidual) 

(d«fQuntity-Typ« Position ZadiTidonl) 
(dofQnantity-Typo Anglo IndlTidonl) 
(dofQnantity-Typo Volocity IndiTidnnl) 
(dof Quant ity-1^ Angnlar-Volocity ZndiTidoal) 
(dofQoantity-l^o Accoloration IndiTidnal) 
(dofQoantity-Typo Angolar-Accoloration ZndiTidnal) 

(dofQnantity-Typo Total-Enorgy Individual) 
(dofQoantity-l^o Potontial-Bnorgy ZndiTidnal) 
(dofQnantity-Typo Kinotic-Enorgy IndiTidnal) 

(dofQoantity-Typo Chargo IndlTidnal) 
(dofQnantity-Typo Cnrront ZadiTidnal) 
(dofQnantity-Typo Voltago ZndlTidnal) 

(dofQvantity-Tjrpo PnU-rootoro Individnal) 
(dofQnantity-Typo Cnrvatnro Individual) 
(dofQnantity-ljrpo diroctional-Forco Individual IndiTxdual) 
(dolQuaatity-Typo Asount-of-at Individual Individaal) 
(dofQuantity-Typo dir-to-contor Individual) 

(dofQuantity-Typo Gravity Individual) 
(dofQuantity-Typo Mass Individual) 
(dofQuantity-Typo VoIum Individual) 
(dofQuantity-Typo Donaity Individual) 
(dofjuantity-Typo Buoyancy Individual) 

(dofZnfluonco I4> 
(doflnlluonco I-) 
(dollnfluonco Ctrana) 
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C.2 Entities 

In QP theory, entities are defined using the form (d«f Entity header body). This defines a 
predicate schema in which belief in the header form implies belief in the facts contained in 
the body. 

(uMrt« '((contaiatd-gas ataosphtr«) . :TRUE)) 

(asstrt* '((liquid alcohol) . :TRUE)) 

(assort* '((substaaco wator) . :TRUE)) 
(assort* '((substoaco alcohol) • :TRUE)) 
(assort* '((sabatsaco ailk) . :TEUB)) 
(assorto '((sabstaaco laol) . rTEUB)) 
(assort* '((snbstsaco air) . :TRUB)) 

(assort* '((Qnaatity (Gravity oartb)) . :TaUB)) 

(dofEatity* (Kotating-Objoct ?obj) 
(Quantity (Aaglo ?obJ)) 
(Quantity (Aagolar-Tolocity ?obj)) 
(DoriTatiTo-of (Anglo ?obj) (Aagular-Volocity ?obj)) 
(Quantity (Xiaotic-Baorgy ?obJ)) 
(not (loss-than (A (Kinotic-Eaorgy ?obj)) zoro)) 

(Q- (liaotic-Eaorgy ?obj) (* (Aagnlar-Volocity ?obj) (Angular-Volocity ?obj)))) 

(dofBntity* (ltanslating--0bjoct ?obJ) 
(Pbysob ?obJ) 

(Quimtity (Position ?obJ)) 

(Quantity (Volocity ?obj)) 

(OoriTatiTo-of (Positioa ?obj) (Volocity ?obj)) 

(Quantity (Kinotic-Bnorgy ?obj)) 

(not (loss-tban (A (Kinotic-Enorgy ?obj)) zoro)) 

(Q- (Kinotic-Enorgy ?obj) (* (Volocity ?obj) (Volocity ?obj)))) 

(dolEntity* (Partod-Objoct ?obj ?pos-part ?nog-part) 
(ObJoct-Part-of ?pos-part ?obj) 
(Objoct-Part-ol ?nog-part Tobj) 
(Opposito-sidos ?pos-part ?nog-part) 
(Quantity (Oir-to-contor ?pos-part)) 
(Quantity (Oir-to-contor ?nog-part)) 
(Groator-thaa (A (Dir-to-contor ?nog-part)) zoro) 
(Loss-tban (A (Oir-to-coator ?pos-part)) zoro)) 

(dofBntity* Thoraal-Objoct 
(Pbysob ?solf ) 

(Quantity (Toq^oraturo ?solf)) 
(Quantity (Boat ?solf)) 
(Quantity (Tboil ?solf)) 
(Quantity (Tfroozo ?solf)) 

(Groator-Tbaa (A (Tboil ?soll)) (A (Tlroozo ?soll))) 
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(GTMttr-TlUA (A (Tfr«M« ?s«lf)) ztro) 
(aot (iMs-Tlua (A (iMt ?s«lf)) ztro)) 

(not (Ittt-Tltu (A (Tti^Atnrc ?«tll)) ztro)) ;Ktlvia 
(Qprop (TMptnturo ?stlf) (iMt ?stlf))) 

(<—♦ (arMt«r*than (a (Aaomt-of ?obj)) ztro) 
((Quatity (Itat ?obj}) 
(Quatity (AMUt-of ?obj)) 
(CrtAttr-th« (A (Htat ?obj)) ztro))) 

(<•«♦ (Grtattr-thMi (A (Htat ?obj)) ztro) ((Stovt ?obj))) 
(T^tnMd-objtct ?obj) ((StOTt ?obj))) 

(dtfbtityt Phytob 

(QuAtity (AMoat-of ?ttlf)) 

(Quatity (duoigt-ratt (Aaouat-of ?ttlf))) 

(Q« (Cltaagt-Eatt (Aaonat-of Tttlf)) (Itt-Iirflntnct (Aaount-ol Tstll))) 

(Quatity (Sarfact-^arta ?ttlf)) 

(aot (Ittt-thaa (A (Aaoaat-of ?ttlf)) ztro)) 

(aot (Ittt-tliaa (A (Sarfact-arta ?ttlf)) ztro)) 

(Qporop (tarfact-arta ?ttlf) (laoaat-of ?ttlf)) 

(Tharaal-Objtct ?ttlf)) 

(dtfBatityt Coataiatd-Liqnid 
(Phytob ?ttlf ) 
(Liqaid ?ttlf ) 
(Statt-of ?ttU liqaid)) 

(<mmm (Coataiatd-Flaid ?cl ?tab ?coataiatr) 
((Coataiatd-Liqaid ?cl) 
(Coataiatr-of ?cl ?coataiatr) 
(Sabtttact-of ?cl ?tab))) 

(dtfSatity* Coataiatd-6tt 
(Phytob ?ttlf ) 
(Gat ?ttU) 
(Statt-of ?ttlf gat)) 

(<nt (Coataiatd-Flaid ?cg ?tab ?coataiatr) 
((Coataiatd-Gat ?cg) 
(Coataiatr*of ?cg ?coataiatr) 
(8abttaact-of ?cg ?tab))) 

(dtfBatity* (Coataiatd-naid ?ct ?tab ?coataia6r) 
(Phytob ?ct) 

(Coataiatr-of ?ct ?coataiatr) 

(Sabttaact-of ?ct ?tab) 

(Quatity (Aaoaat-of ?ct)) 

(Qaaatity (Prttturt-ia ?coataiatr)) 

(Qporop (Prttturtr-ia ?coataiatr) (Aw>aat-of ?ct))) 

(adb:ralt :iattra ((eoataiatd-flaid ?ct ?tab Tcoataiatr) 



218 



(liquid ?ct)) 
(iutall*nuitia«*qprop «(PrMSur«-in ,?coiitAia«r) 

'(Qprop (Pr«ssar««-iB .Teontaintr) (aaowit-of ,?cs)) 
'(((contain^d-fluid ,?c« ,?mib .Tcoatmiatr) . :TRUE) 
((liquid .?c«) . :TRUE)))) 

(d«fSiitit7« (Solution ?tolutioii) 

(Quantity (Concontrntion ?solution)) 

(Quantity (Saturation-Point ?«olution)) 

(not (laM*than (A (Concantration ?aolution)) zaro)) 

(not (lasa-than (A (Saturation-Point Taolution)) zero))) 

(dafEntitya (6aa-Niztura ?aolution) 

(Quantity (Concantration ?aolution)) 

(Quantity (Saturation-Point Taolution)) 

(not (laaa-than (A (Concantration Taolution)) zaro)) 

(not (laaa-tlian (A (Saturation-Point ?aolution)) zaro))) 

(<■«♦ (Mixtura ?aol) ((Solution ?aol))) 
(<»«♦ (Mixtura ?gaa) ((Gaa-Mixtura ?gaa))) 

(<»■♦ (contalnar-of ?co^>onant ?containar) ((mixtura Tnixtura) 

(prinary-conponant-of ?»ixtura Tcoaponant) 

(containar-of Tniztura Tcontainar))) 
(<««♦ (containar-of Tcoaponant Tcontainar) ((nixtura ?«ixtura) 

(aacondary-co«ponant-of Tnixtura Tcoaponant) 

(containar-of Tnixtura Tcontainar))) 

(dafantitya (Spring Tapring) 
(Quantity (Oiaplacaaant Tapring)) 
(Quantity (Raatorativa-Forca ?apring)) 
(Qprop- (EaatoratiTa-Forca ?apring) (Oisplacaaant ?apring)) 
(Corraapondanca ((A (Raatorativa-forca ?apring)) ZERO) 

((A (Oiaplacaaant ?apring)) zaro)) 
(Quantity (Potantial-anargy Tapring)) 
(not (Laas-tlian (A (Potantial-Enargy ?apring)) zaro)) 

(Q« (Potantial-Enargy ?apring) (• (Diaplacanant ?8pring) (Oiaplacenent ?8pring)))) 

(daf Entity* (Spring-Nasa-Syataa ?ayatan ?apriag Tnaaa) 
(Connactad Taaaa Tapring) 
(Quantity (Total-Enargy Tayataa)) 
(not (laaa-than (A (Total-Enargy ?ayatan)) zaro)) 
(Equal-to (D (Total-Enargy Tayatan)) zaro) 

(Q- (Total-Enargy ?ayataB) (♦ (Kinatic-Enargy ?Maa) (Potantial-Enargy ?apring))) 
(Q- tOiaplacanant ?apring) (Poaition ?naaa)) o/ r 5 

(Forca-Application (Eaatorativa-Forca ?apring) (Valocity Tnass))) 

(dafEntitya (air-plana-ving ?ving) 
(Phyaob ?ving) 
(Ttanalating-Objact ?iing) 
(Quantity (Curvatura (Baida ?wing))) 
(Quantity (Curvatura (Taida ?ving))) 
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(Oraatar-tlwB (A (Cnrvatnr* (Bsid* ?*iag))) zero) 
(PartAd-Objaet ?wiag (T«ld« ?wlaj) (Bsid* ?wing)) 

(Qr«at«r-thu (A (Corratwr* (Tsid* ?wiag))) (A (Curratur* (Bsid* ?«ing))))) 

(d«fBatit7* (bUl Tbidl) 
(Phyaob 7b«U) 

(Qiuatity (Carratnr* (Baid* ?tall))) 

(Qaaatity (Cnrratnr* (Taid* ?ball))) 

(OrMtur-tlwB (A (Cnrvatar* (Baid* Tball))) z«ro) 

(Part«d-ObJ«et ?ball (Tald« ?ball) (Baid* ?ball)) 

(Bqul-to (A (Cnrratnr* (Baid* ?ball))) (A (Cnrratur* (Taid« ?ball))))) 

(dafBatity* (Danaa-Objact ?obj) 
(Quantity (Saaaity ?obj)) 
(Qaaatity (Naaa ?obj)) 
(Qaaatity (Voloaa ?obj)) 
(graatar-thaa (A (Oaaaity ?obj)) zero) 
(graatar-thaa (A (Voloaa ?obj)) zaro) 
(graatar-thaa (A (Maaa ?obj)) zaro) 
(Q« (Naaa ?obj) (• (Voloaa ?obj) (Oaaaity ?obj)))) 



C.3 Processes 

Processes are defined using the form 

(def Process ({Name) {Individuals,, 

Individuals (({Individual) .-conditions {Conditions)) 

• • • ) 

Preconditions (Facts) 

Quant it yCondit ions (Inequalities) 

Relations (Facts) 

Influences (Influences)) 

The individuals specify what objects would be involved in the process if it were active, 
the preconditions and quantity conditions indicate when the process will be active, and 
the relations and influences specify what relations will hold while the process is active. A 
similar construct, defView, is used to express conditioned relations that do not include 
influences. 

(d«fProc«st« (Liqnid-Flow Ttnbst ?soiurc« ?trc<*cs ?d«stination ?dst-cs ?patli) 
ZndiTiduls ((?snbtt : conditions (snbstanct ?tiibst) ;tos« stull 

(Liqnid ?siibst)) ;stat« 
(?scarc« : conditions (Can-^Contain ?soarc« ?tnb8t)) 
(Tsrc-^cs : conditions (Containod-Flnid ?src-cs ?sttbst ?soarc«)) 
(?d«stinat^on : conditions (Can-Contain ?do«tinntion ?siibst)) 
(?dst-cs conditions (Containtd-Flnid ?dst-cs ?sttbst ?d«stination)) 
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(Tpath :eonditi9!id (naid«*Path ?path) 

(Pliysieal«*Patli ?soare« TdMtiAation ?path))) 
Prtconditioas ((fliiid-aliga«d ?patli)) 

QnaatitjCQnditioiu ((graatar«*tliaft (A (Pra»«ur«-iii ?«ourc«)) 

(A (Pra«tura-ia TdMtination))) 
(graatar-thift (A (Aaoiiat-of ?sre*es)) zaro)) 
Ralatioiu ((Quantity (flov-rata ?salf)) 

(Q« (flov-rata ?a«lf) (. (prassura-in Taourca) (praaaara-i& Tdaatinatlon))) 
(Graatar-tluui (A (flov-rata ?aalf)) zaro)) 
laflnaneaa ((Ctrana (asoiiat-of ?arc-ca) (aMUt-of ?dat-ca) (A (llow-rata ?aall))))) 

(dafProcaaaa (Liquid-Drain ?aink ?aink-ca ?lf) ;tn idaal aink 

IndiTidnala ((?aink :eon^itiona (Liquid-Sink ?aink)) 

(?aink-ea :eonditiona (eontainad-Uquid ?aink-ea) 

(eontainar-of ?axnk-ea ?aink)) 
(?11 :eottditiona (Proeaaa-Inatanea-of Liquid-Flov ?11) 
(?lf daatination ?aink))) 
QuantityConditiona ((Activa ?lf)) 

Xnfluancaa ((I- (aMunt-of ?aink-ca) (A (llow-rata ?11))))) 

(dafProcaaaa (laat-Flov ?aonrca ?daatination Tpatk) 

IndiTiduala ((Taourca :eonditiona (Thanul-Objact ?aourca)) 

(Tdaatination :eoaditiona (Tkanwl-Objact Tdaatinatlon)) 
(?path zeonditiona (laat-Path ?path) 

(laat-Connaction ?path Taourca ?daatination))) 
Praconditiona ((haat-alignad ?path)) 

quantityConditiona ((graatar-than (A (twparatura ?aourca)) (A (ta^)aratura Tdastination))) 
(graatar-than (A (aaount-of ?aourca)) zaro) 
(graatar-than (A (asount-of ?daatination)) zaro/) 
Ealationa ((Quantity (haat-llow-rata TaalD) 

(Q« (haat-llow-rata ?aall) (- (taqparatura ?aourca) 

(tasparatura ?daatination) ) ) ) 
Influaacaa ((CTtana (haat Taourca) (haat Tdaatination) (A (haat-llow-rata ?8all))))) 

(dafProcaaa* (Boiling ?aubat ?containar ?cl ?cg ?hl) 
IndiTiduala ((?aubati :eonditiona (aubatanea ?aubat)) 

(?containar :eonditiona (can-contain ?containar Taubat)) 
(?cl ;conditioaa (containad-liquid ?cl) 

(containar-ol ?cl ?containar) 
(aubfttanca-ol ?cl ?aubat)) 
(?eg :coaditiona (containad-gaa ?cg) 

(containar-ol ?cg ?ccntainar) 
(aubatanca-ol ? ?aubat)) 
(?hl :conditiona (Procaaa-Inatanca-ol Haat-Flow ?hl) 
(?hl daatination ?cl))) 
QuantityConditiona ((Actiwa ?hl) 

(not (graatar-than (A (tboil ?cl)) (A (tanparatura ?cl)))) 

(graatar-than (A (iUiount-ol ?cl)) zaro)) 
Ralatioaa ((Quantity (Vaporization-Rata ?aall)) 

(Q« (Vaporization-Rata ?aall) (haat-llow-rata ?hl)) 
(graatar-than (A (vaporization-rata ?aall)) z«ro)) 
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lafliimcM ((X- (hMt ?el) (A (Taporisation-rat* ?t •!!))) 

(fiTrm (iBouit-^of ?cl) (aaowt-of ?cg) (A (Taporization-ratt ?••!!))) ) 
Doration (1800 14400)) 

(dafProcMi* (lMt-ft«pl«aish ?ioiirct ?W) ;an id«al toarc« 

ladiTiduals ((?sovc« leoaditioiu (Bm-Sourc^ ?ioiirc«)) 

(Thf :coaditiou (Proc«is-Iastanc«-of HMt-Flov ?hf) 

iotirc« ?ioiirc«))) 

QaiatitfCondltioiu ((Active ?hf)) 

UlatioM ((Bqoal-to (D (iMt ?ioiirc«)) ZBEO)) 

ZafluMCM ((!♦ (hMt ?smirc«) (A (hMt-flov«-rat« ?hf))))) 

(dtfProcMi* (DiiiolT« ?ioliit« Tiolntion) 

ladiTiduali ((?ioliii# 'coaditions (Solid ?ioliit«) 

(Soluble ?ioIiit«)) 
(?ioliition :coBditio]u (Solution tiolution) 

(iMi«ri«d-i]i ?soliito Tiolntion))) 
Preconditions ((Solnblt-in Tiolnto ?iolntion)) 
QoantitjConditiona ( (Groatar-than (A (Aaonat-of Tiolnto)) zoro)) 
Ulationa ((Qoa&tity (Disiolva-rato ?ioU)) 

(qprop (diiiolTr-rata ?ialf) (contact-inrfaca Tiolnta Tsolution)) 
(groatar-thaii (A (diisolva-rata ?ialf)) zaro)) 
Xaflnaacai ((Clraat (uonat^ Tiolnta) (concantration ?ioliition) 

(A (diisolTa*rata ?ialf)))) 
Cadia ((Support! (ijHiaraad-ia Tiolnta ?iolntioa) 

(prartq (Ctraaa (aaonat-of ?iolnta) (concantration ?ioltition) 
(A (diiiolva-rata ?ialf))) 
(Phyiical-Patli ?ioliita ?iolntion ?path) 
(Continoity-Uv) ) ) . ^ 

(dafViave (Buoyancy ?obJ Tliquid) 

IndiYidoali ((?obj : conditions (Ttanslating-Objact ?obj) 

(Dansa-Objact ?obJ)) 
(Tliqaid : conditions (Liquid ?liquid) 

(Dansa-Objact ?liqnid) 
(lMariad«in ?obj ?liquid))) 
Ralations ((Laii-than (A (Gravity aarth)) zaro) 
(q> (Gravity aarth) (- (Mass ?obj))) 
(Forca-applicatioa (Gravity aarth) (Valocity ?obj)) 

(Daasa«-0bjact (Portion-of ?liquid)) 

(Quantity (Buoyancy ?liquid)) 

(Q« (Buoyancy Tliquid) (Mass (i ^ rtion-ol ?liquid))) 
(Porca-q^lication (Buoyancy ri^quid) (Valocity ?obJ)) 
(Cteraipondsnca «A (Gravity aarth)) (A (Buoyancy ?liquid))) 

((A (Nasi ?obj)) (A (Nasi (Portion-of ?liquid))))))) 

(dafViava (FuUy-Subaargsd-Objact ?obj ?liquid) 

Zndividuali ((?obj : conditions (Ttanilating-Objact ?obJ) 

(Dsnia-Objact ?obJ)) 
(?liquid :conditic (Liquid ?liquid) 
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(D«ii«-Obj«et niqnid) 
(I"«ri«d-ia ?obj Tliqnid))) 
Rc\ationi ((q« (Voliui« (Portion-of ?liqiiid)) (Voliw* ?obj)) 
(Usi-thaa (A (Position ?obj)) soro) 
(Groattr-thaa (A (Donsity ?obj)) (A (Doniity riiqnid))) 
(Qpirop (Douity (Portion-ol ?liqiiid)) (Doniity Tliquid . 
(Qprop- (Douity (Portion-of Tliquid)) (Poiition ?obj)) 

Horo, wo'll tako tlio ''dtriity'' of tho liquid to b« at S.T.P. 
(Corraipondmea ((A (Doniity (Portion-of ?liqnid))) (A (Daniity ?liquid))) 
((A (Poiition ?obj)) zoro)))) 

(dofViow* (Floating-Ob j act ?obj ?liqaid) 

XndiTidnali ((?obj :eonditiona (Ttanslating-Objaet ?obj) 

(Danio-Objaet ?obj)) 
(Tliqnid :eonditioni (Liquid ?liqnid) 

(Danaa-Objaet ?liqaid) 
(Floating-on ?obj ?liqnid))) 
Eolation! ((Qprop- (Volwa (Portion-of ?liqnid)) (Poiition ?obj)) 
(Corraipondanea ((A (Voluo (Portion-of ?liqnid))) zaro) 

((A (Poiition ?obj)) zaro)) 
(not (Graatar-tlian (A (Poiition ?obj)) zaro)) 
(not (Graatar-than (A (Dansity ?obj)) (A (Dantity ?liquid)))) 
(q« (Danaity (Portion-of riiquid)) (Daniity riiqnid)))) 

(dafViava (Air-Envalopa ?air ?obj ?poi-part ?nag-part/ 
ZndiTidnali ((?air :eonditiona (Floving-Air ?air)) 
(?obj .'Conditions (Phyiob ?obj) 

(Bnvalopad ?obj ?air)) 
(?poi-part .'Conditions (objact-part-of ?poi-part ?obj)) 
(?nag-part :conditioni (objact-part-of Tnag-part ?obj) 

(Opposita-iidai ?poi-part ?nag-part))) 

Ralations ((Touching ?pos-part ?air) 
(Touching ?nag-part ?air) 
(Quantity (Aaonnt-of-at ?air ?poi-part)) 
(Quantity (Aaonat-of-at ?air ?nag-part)) 
;aaaaehaat... 

(Qprop- (Aaonnt-of-at ?air ?poi-part) (Poiition ?obj)) 
(Qprop (Aaount-of-at ?air ?nag-part) (Poiition ?ebj)) 
(Graatar-than (A (Aaoont-of-at ?air ?poi-part)) zaro) 
(Graatar-than (A (Aaoont-of-at ?air ?nag-part)) zaro) 
(Corraipondanea ((A (A«oant-of-at ?air ?poi-part)) 
(A (Aaonnt-of-at ?air ?nag-part))) 
((A (Poiition ?obj)) zaro)))) 

(dafViava (Floid-Contact ?flQid Tobj ?part) 

IndiTidnali ((?fliiid .-conditions (Flowing-Fluid Tflnid)) 
(?obj : conditions (Physob ?obj)) 
(?part : conditions (objact-part-of ?part ?obj) 
(touching ?part ?fliiid))) 
Ral«tions ((Quantity (A»onnt-of-at Tflnid ?part)) 

(Graatar-than (A (Aaonnt-of-at ?fliiid ?part)) zaro))) 
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(d«fVi««* (Alr-Plaa«-Lift ?air ?obj ?p*rt) 

ladiTiduls (X?air rcoaditiou (novixig*-Alr ?air)) 

(?obj : condition (Air*Plan«-¥i]ig ?obj)) 
(?part :eaaditio]u (obj«ct-part-of ?part ?obj) 
(traddag ?part ?alr))) 
R«latioiif ((Quantity (diractional--Forca ?air ?part)) 

(LtM-thaa (A (dlr^ctioaal-Forca ?air ?part)) saro) 
(Qprop- (dlractioaal*Forea ?air ?part) (Corratiira ?part)) 
(Qprop- (dlractioaal-Forca ?alr ?part) (Aaoiut-of-at ?air ?part)) 
(Carraspondanca ((A (dlractioaal-Forca ?air ?part)) zaro) 

((A (Cnrratiira ?part)) xaro)) 
(Corraapoadanca ((A (diractional-Forc^ ?air ?part)) zaro) 
((A (AMQBt-^f-at ?air ?part)) zaro)))) 

(daf?iav« (PoaitiYa*Spaea ?obJ) 

ladiTiduala ((?obj :coadltioaa (Quant ity (Poaition ?obj)))) 
Quant it jConditions ((Graatar-than (a (poaition ?obj)) zaro))^ 

(dafViava (lagntiTa-Spaca ?obj) 

ZndiTiduala ((?obj :conditiona (Quantity (Poaition ?obj)))) 
QuantitjConditiona ((laaa-than (a (poaition ?obj)) zaro))) 

(dafViava (Fluid-puah ?fluid ?obj ?part) 

ZndiTiduala ({?fluid rcondiciona (Flowing-*Fluid ?lluid)) 
(?obJ :conditiona (Phyaob ?obj)) 
(?part :condit^o^ (Objact-part-of ?part ?obj) 
(Touching ?part ?fluid) 
(Puahad ?part niuid);) 
Ralationa ((Quantity (diractional-Forca Vlluid ?part)) 

(Graatar-than (A (diractional-Forca Tlluid ?part)) zaro) 
(Qprop (diractional*-Foixa ?fluid ?part) (Aaount-of-at ?fluid ?part)) 
(Corraapondanca ((A (diractional*-Forca ?llnid ?part)) zaro) 
((A (Anount-ol-at Tfluid ?part)) zaro)))) 

; ; ; Oanoaia * dif f ua; a azaapla 

(daf?iaaa (Miztura-Coabination ?«ixtura ?priiury-co^K)nant ?aacondary«coBponant) 

IndiTiduala ((Tniztura rconditiona (Nixtura ?nizturaj 
(Tpriiury-conponant :conditiona 

(PriMry-coaponant*»of Taiztura Tpriwary-conponant)) 
(?aacondary*eoHponant :conditiona 

(Sacondarr^Oiiiponant*-of Tairtura ?aacoodary^o^>onant))) 
EaUticua ((Quantity (Concantration Taiztura)) 
(Quantity (Saturation^Point Tniztura)) 
(Quantity (Avount-of Taiztura)) 

(Quantity (Aaount-of-in ?»iztura ?priaary*eonponant)) 
(Quantity (Aaount-of-in ?niztura Taacondary^eonponant)) 
(not (laaa-than (A (Aaount-of^in ?niztura Tprinary-conponafi'^) zaro)) 
(not (laaa-thaa (A (AMunt*of-in Tniztura ?aacondary*co^>onant)) zaro)) 
(Q> (AMunt*of ?niztura) 
(♦ (Aaornt-of*in Tniztura Tpriaary-conponant) 
(Aaiount*of-in ?niztura ?aacondary*coq>onant))) 
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(Qprop (ConcMtration Taixturt) (Aaoiiat^of-in ?Blztnr« Tprutary-eoaponwit)) 
(Qprop- (Conetntration ?mixtiir«) (Aaowit-of-ia ?Biztiir« Titeondarj-eoBponant)) 
(Corr«ipoad»e« ((i (Coneutratioii Taizturt)) ZERO) 
((A (AM!mt*of-ia ?Mixtiir« Ticeondary-eoapoiiMit)) ZERO)))) 

(d«fProe«ii* (Onofii Tiubit ?ioiirc« ?ire-Biztnr« ?ire-ec 
?d«itiaatioii ?dat-«iztiir« ?dit*ei Tpath) 
ladiTiduli ((?iiibst :eonditioiii (mbitanet Tuibit) ;iom ituff 
(Liquid ?iiibit)) ;gtat« 
(?ioiire« :eoaditioni (Cta-Contaia ?ioiire« Tiubst)) 
(?8rc-Bixtiir« :eoiiditioiii (Nixtvrt ?ire-«ixtiir«) 

(co&taia«r**of ?ire-Biztiir« ?soiire«)) 
(?src-ci : conditions (Coataiasd-Flvid ?8re-ci ?ivbit Tioureo) 

(PriMrj-eoaponont-of ?ire-Bixtiiro ?ire-es)) 
(TdoitiMtion :eonditiou (Cu-Contaia Tdoitiaatioa ?ivbit)) 
(Tdst-aiztnro :eoiiditioM (Nixtnro ?dat-aixtiiro) 

(coataiaor-of Tdit-aixtnro ?doitiaation)) 
(?d0t-C8 :coaditioai (Coataiaod-Flaid ?dit-ei Trobit Tdoitination) 

(Priaary-coaponoat-of ?dit-aiztiiro ?dit-ci)) 
(Tpath :coaditioa8 (Solation-Path ?path) 

(Phyiical-Pat^ Tioareo Tdoitiaation ?path))) 
Procoaditioat ((iwi-paraoablo ?path)) 

QuaatityCoaditioai ((groator-thaa (A (Coacoatration Tirc-aixturo)) 
(A (Coacoatratioa ?dat-aistiiro))) 
(groator-tliaa (A (Aaoaat-of ?8rc-ci>) zoro)) 
Rolatioas ((Qaaatity (flov-rato ?iolf)) 

(Q« (flov-rato ?iolf ) (- (eoacoatratioa Tire-aiztaro) 
( coacoatrat ioa ?di t-aiztar • ) ) ) 

(Croat or-thaa (A (llow-rato ?ioll)) zoro)) 
laflaoacoi ((Ctraaa (aaouat-of ?irc-Ci) (aaoaat-of Tdit-cs) (A (llow-rato ?8oll))))) 

(dofProcoooo (Appliod-Forco ?obj ?part ?irc) 
ladiTidualo ((?obj :eoaditioas (Phyoob ?obj)) 

(?part :coaditioao (objoct-part-of ?part ?obj)) 
(?irc :eoaditioas (Qoaatity (diroetioaal-Forco Tore ?part)))) 
QaaatityCoaditioaa ((aot (Equal-to (A (Dir-to-coator ?part)) ZERO))) 
Rolatioai ((Quaatity (forco Tare ?part)) 

(vhoa (Groator-thaa (A (dir-to-coator ?part)) zoro) 

(Q« (lorco Tore ?part) (diroctioaal-Forco ?irc ?part))) 
(vhoa (losi-thaa (A (dir-to-eoator ?part)) zoro) 

(Q« (lorco Tore ?part) (- (diroctioaal-Forco Tore ?part))))) 
lafluoacoi ((!♦ (Volocity ?obj) (A (lorco Tore ?part))))) 

(dolProcoiiO (Forco-Procoii ?»rc ?d»t;) 

ladiTiduali ((?irc :coaditioai (Quaatity (?lorco ?irc))) 
(?dit :coaditioai (Quaatity (?to1 ?dat)) 

(Forco-Applicatioa (?lorco ?irc) (?voi ?dit)))) 
lalluoacoi ((!♦ (?t«1 ?dit) (A (?lorco ?irc))))) 

; ; ; Goaoric procoii lor haadliag tho itatoaoat (dorivatiTo-ol ?q ?qd) 
(dolProcoiso (DorivatiTo-Procoii ?aaouat ?doriTatiTo) 
ZndiTiduali ((?aaouat :coaditioai (Quaatity ?taount)) 
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(?d«ri¥atiT« :eonditioa« (Quantity ?d«rimiT«) 

(Dtri¥atiT«-of Taaorat ?d«riniti¥«))) 
iBflTitncM ((I4> TtMUt (A ?d«riTat !▼•)))) 
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Appendix D 
Phineas Experiences 



PHIMEAS begins each explanation task with a set of previously explained experiences which 
are indexed according to their behavioral abstractions. It's initial task is to retrieve poten- 
tially relevant analogues from this set of stored experiences. This section lists the complete 
set of experiences with which PHINEAS begins. 



D.l Behavioral Abstractions 

(dtfB«luivioral*Abstraction PIYSZCAL-NOVENEIT 

SubTjpts (mtttr*Bovm«nt vavt-BOV«iMit phas«-ehaiig«-Movra«nt)) 

(d«fBthavioral*Ab«traction Mtttr^wvMtnt) 
(dtfBthavion 1-Abstraction wav«-Bovta«iit) 

(dtfB«havioral*Abstraction phast-chaagc-Bovmrat 

SiibTyp«« (•olid-|)bM#-cliaiig«-«>T« llttidic-phM«-chang6-w)v« gas-phasa-changa-move)) 
(dafBaliaTioral*Abatraction iolid*phaaa-clianga-aoTa) 
(dafBahaTioral*Ab«traction fluidic-pliaaa-changa-MTa) 
(daf BalULTioral^Abstract ion gaa-phua-changa-BOTa ) 

;;; Observable, characterizing ifpe physical movement 

(dafBaUTioral-Abatraction PHYSICAL-MOVEMEIT-CHAIUCTERISTiCS 

SiibTypas (corpascvlar-BOTaaaBt contiauoua^BOTaBant vavalika)) 

(dafBaliaTioral--Abatraction corpuaciilar-BOT«aiant) 
(dafBahaTioral-Abatraction vaTalika) 

(dafBaluiTioral-Abatraction continaona-BovaMnt) 

Interaction types 

(dafBahavioral-Abatraetion Contact-aeanaa 
SubTjpaa (fluid-flov aolid-contact) ) 
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(dtfB«hMioral-Atetraction fluid-^low) 
(d«fB«haTioral*Abstractioti solid-eont&ct) 



fit 

(d«fB«hATioral*Abstraction NODB-OF'^FOliCE 

SubTypM (action-at-a-dist«ac« dlr«ct-forc«)) 

(dtfBeliaTioral^Abttraction Mtion-at-a-distanc«) 

(d«fBt]iaTioral*Abg tract ion dir«ct-fore« 

SuhTjfM (pnsh-forct piiU-forc« twisting-f orc«) ) 

(d«fB«]i«fioral*Abttractioa pall-forc«) 
(dtfBtliaTioral-Abstraction twlstiag-f crc# ) 
(dtfBtliaTioral-Abstraction piuh-forc«) 

;;; Behmvior types - graphical 

(d«fB«haTioral*Ab0trAction GBAPfllCAL-CHARACTERIZERS 
SubTjp#s (aonotonic cyclic)) ;iioii-dMcript) 

(d«fB«haTioral*Abftraction aonotonic 

SubTjp#s (liAMT uiyBptotic*-i9proach)) 

(d«fB«haTioral*Abttraction linear) 

(d«fB€luiTioral*Abttraction cyclic 

SiibT7p«s (siauoidal rnp-cyclic siBpla-cyclic)) 

(dtfBtbaTioral-Abr tract ion siauoidal) 
(d^fBtbaTioral-Abttraction raap-cyclic) 
(dtfBtbaTioral-Abttraction •i«pl«-cyclic) 

(dtfBtbaTioral-Abttraction uyaptotic^-approach 
SubTypM (i^proach-constant diua--qpproach)) 

(dtfBabaTioral-Abttraction i^proach^-constant) 

(dtfBtbaTioral-Abstraction daal^-appronch 

SubTyp«s (dud-approaeUag dnal^approach^finith)) 

(dtfBabaTioral-Abttraction dnal^-i^roachiag) 
(dafBabaTioral-Abttraction dual--i^roach*finith) 

Compile it 

( pila-abt tract ion^-hiorarchy) 
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D.2 Past Observations 



Two conUiner lipoid flow 

(sMrdtfBatity wattr) 
(sM:d«fBatit7 liquid :eonit«Bt? t) 
(•M:dtfBfttit7 b«ak«r3) 
(flM:dtfBatity ▼ial2) 
(■M:dtfBatity ci-vat«r-b«ak«r) 
(8M:dtfEntity ei-wat«r-¥ial) 
(■MtdtfBatity pip«l) 

(dtfObianration 2*contaia«r-lf 
Behavior 2-contai&tr-lf 
I&diTidvali (baaktr3 Tial2 pip«l) 
Vorld ((contaiud-liquid ci-vatar-baaktr) 

(containar-of ci-watar-baaktr baakar3) 

(mbitanca-of ci-vatar-b«aktr watar) 

(coataiaad-liqaid ci-watar-Tial) 

(contai&ar<*of ci-«atar-¥ial ▼ial2) 

(aubitanca-of ci-vatar-Tial watar) 

(can-contain baakarS watar) 

(can-contain Tial2 jatar) 

(anbatanca watar) 

(Liquid watar) 

(Flnid-Patb pipal) 

(Pbyaical-Patb baakarS ▼ial2 pipal))) 

(dafBSapiont 2-containar-lf 
Cbaracvarizationi ((uttar-MTaaant ?ialf) 

(continnona-aoTaMnt ?ialf ) 

(dnal-approach-finiih ?ialf)) 
Coaponanti (2-containar-iitO 2-containar-iitl) 
lalationa ((aaati 2-containar-iitO 2-containar-8itl))) 

(d«f Situation 2-containar-iitO 
Charactarizationi ((mttar-aoTaaant ?iall) 

(continnona-aoTanant ?ialf ) 
(dnal-approaching ?ialf)) 
Procaaaaa ((liqaid-llow pil ((?Mbit . watar) (?ionrca . baakar3)(?irc-ci . ci-watar-baakar) 

(?daitination . ▼ial2)l?dit-ci . ci-watar-vial) (?path . pipal))) 
(co«taina4-llttid ((?ci . ci-watar-baakar)(?iiib . watar) 

(?containar . baakar3))) 
(containad-flnid ((?ci . ci-watar-wial)(?Mb . watar) 
(?containar . wial2)))) 
Indiwidoala (watar ci-watar-baakar ci-watar-wial baakarS vial2 pipal liquid) 
Dynaaici ((Dacraaaing (iaount-of ci-watar-b«akar)) 
(Dacraaiing (Praiaura-in baakarS)) 
(Incraaaing (iaount-of ci-watar-vial)) 
(Incraaaing (Praiiura-in Tial2)) 

(Bxponantial-approacb-Dual (Praiiura-in baakar3) (Praiiura-in vial2)) 
(Graatar-Than (A (Praiiura-in baakar3)) (A (Praiiura-in vial2))))) 
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(d«fSitiiAtim 2*eoatAin«r-sitl 
IsdiTidttilt (cfwatw-b««lc«r et-wattr-Tial) 
DTMiUcs ((Coastut (Aaooat-of es*watMr*b«aktr)) 

(Cmstant (PrMSurt-in b«ak«r3)) 

(Cmstant (Aaaiuit-of et«-VAt«z^Tial)) 

(Coutant (PrMsurt-in Tial2)) 

(Bqoal-to (A (PrMtnra-ia baakwS)) (A (PrM«urt-in ▼lal2))))) 

;;; Lip^ii draining from a kakf cnp 

(sM;d«fBntit7 watar) 
(sM:d«fBatit7 laaky-cvp) 
(sM;dafBntit7 ct-laaky) 
(sMtdafBntity tiakS) 
(•M:dafBatit7 cs-siak5) 
(aM;dafBBtit7 liolaT) 

(daf Obsarratioa li<iaid-drainiag 

Bahavior liqoid-draiaiag-bahaTior 

ZadiTidaals (laak7-eiip ca-laak7 watar aiakS ca-aiak5 hola7) 
Vorld ((Liqaid*SiBk friaU) 

(contaiaar laalqr-cap) 

(caa-coataia laakT-cap watar) 

(contaiaad-liqaid cs-la^k7) 

(coataiaar-of ca-laak7 laaky-cap) 

(aabataaca-of ca-laaky vatar) 

(caa*coataia aiakS watar) 

(coataiaad-liqaid ca*aink5) 

(coataiaar-of ca--aiak5 aiak5) 

(aabataaca-of ca*aiak5 watar) 

(aabataaca watar) 

(Liqaid watar) 

(Hola*of laak7--cap hola7) 

(naid-Path hola7) 

(Ph7aical-Path laak7-ciq) aiakS hola7))) 

(dafBSaffiant liqaid-draiaiag^bahaTior 
Charactariaatioaa ((uttar-MTaaaat ?aalf) 

(caatiaaoaa-BOTaaaat ?aalf ) 

(aoBotaaic ?aalf)) 
Coqwaaata (llqaid-dralaiag-baag liqaid-draiaad-baag) 
Ralatioaa ((saata liqaid-drainiag^baag liqaid-draiaad-baag))) 

(dafSitaatioa liqaid-draiaiag--baag 
Oiaractariaatioaa ((uttar-aoTaaaat ?aalf) 

(coatiaaoaa-wTaaaat ?aalf ) 
(■oaotoaic ?aalf)) 

?T0cnaan9 ((liqaid^flow pil ((?aabat . watar) (?aourca . laaky-cup)(?8rc-ca . cs-laaky) 

(?daatiaatioa . aiak6)(?dat-ca . C8-siak6)(?path . hol67))) 
(liqaid-draia pi2 ((?aink . aiak6)(?aink-K:a . ca-sink6)(?ll . pil)))) 
Zad^/ridaala (laak7-cap ca-laaky watar aink5 ca-alnkS liola7) 
Dyaaaica ((Oaeraaaiag (AMant^of ca^laaky)) 
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(6rMt«r-t)iaa (A (Aaoont-of ei-l«aky)) z«ro))) 



(dtfSituation liquid--dnin«d-bi«g 
ladiTiduli (cs-l«aky) 
DyiUMici ((CoBitant (Aaoimt-of ei-l^aky)) 

(Bqaal-to (A (AMont-of ei-ltaky)) zwo))) 

;;; Boaing 

(n«:d«f»tit7 wat«r) 
(sM:d«f«ntit7 Mg^^alit^) 
(MM:d«f«ntit7 illini-watw) 
(sM:d«f entity illixii-it«ui) 
(•M:d«x«ntit7 wMtiaghom^) 

(dtfObiwration boiling-obi 
BthaTior boiling-b«haTior 

XndlTidiiali (w«itiBghoiu« aagnalitt illini-vatw illini-staafli) 
Vorld (dtOT^ w«iti]ighoiu«) 
(pan ugnalita) 

(containad-liqaid illini-wattr) 
(■ttbitanca-of illi&i-*watw watw) 
(containtr-of illini-watar ugnalita) 
(atatt-of illini-vatar liquid) 
(co&tainad-gaa illini-itaaa) 
(aiibatanea-of illini-itaaB vatar) 
(containar-of illini-itaaa Mgnalita) 
(atata-of illini-ataaB gas) 
(can-eontaia ugnalita vatar) 
(thanal-objact waatiaghoiiaa) 
(thanal-objaet iUiai-watar) 
(baat-aoiirca waatiaghooaa) 

(phyaieal-path waatiaghoiiaa illini-vatar aagnalita) 
(baat-path aagnalita))) 

(d^fBSapiant boiliag-babaTior 
Cbaraetarizationa ((fluidie-phaaa-changa-BOTa ?aalf) 

(continuom-BOTaBant ?aalf) 

(■onotonie ?aalf)) 
Coi^nanta (boiliag-baag boiliag-dry) 
Ralationa ((aaata boiliag-baag boiling-dry))) 

(dafSitnatioa boiling-biag 
Charaetarizationi ((flnidie-phaaa-ebanga-BOTa ?iaU) 
(eontinnons-BOTaaant ?aalf) 
(■onotonic ?ialf)) 

Proeaiiaa ((haat-flow pil ((?ionrca . waitinghonia)(?daitination . illini-watar) 

(?path . aagnalita))) 
(boiling pi2 ((Tcontainar . aagnalita) (Tiabat . watar)(?cl . illini-watar) 

(?cg . mini-itaaB)(?hf . pil))) 
(haat-raplaniah pi3 ((Tiourca . waitinghonia)(?hf . pil)))) 
ZndiTidnala (watar vaitinghouta aagnalita illini-aatar illini-staaa) 
Dynaaici ((Constant (Taaparatura waitinghonia)) 
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(Cmstaat (TMp«ratiir« illini*«At«r)) 

(Iqiul-to (A (TMpwaturt vtitiaghomt)) (A (T«^p«ratur« illinl-wattr))) 
(OrMtw-thaa (A (Aaoimt-of iUini-vattr)) z«ro) 
(D^crMsiag (Aaoiuit-of illini-vattr)) 
(Increasing (AMUt-of mini-it«ui)))) 

(defSitvation boiliag-drj 
ZndiTidnali (Uliai-vatw) 
Oyaudct ((Conttaat (AMimt-of illi&i-vatar)) 

(Bqpal-to (A (AMuat-of llini-vatar)) zaro) 

(Graatar-thaa (A (AMuat-of iUini-itaan) ) zaro))) 

;;; Di$$olm%§ 

(aaa:dafaatit7 vatar) 
(iM.'dafaatity vatarl) 
(aaatdafantity glaai4) 
(8Ba:dafantit7 inltl) 

(dafObiarration dlsiolTiag 

BahaTior diiiolTa-bahaTior 
Xndividnali (watarl vatar laltl gla8i4) 
World ((contaiaad-liquid vatarl) 

(lubitaaca-of vatarl vatar) 

(coataiaar-of vatarl glaai4) 

(itata-of vatarl liquid) 

(solid saltl) 

(solabla saltl) 

(solabla-ia saltl vatarl) 

(solatioa vatarl) 

(iaiarsad-ia saltl vatarl))) 

(dafBSapiaat dissolTa-bahavior 
Charactarizatioas ((solid-pha ^*changa-aoTa ?8alf) 

(coatiaMas-aoTamat ?salf ) 

(aoaotoaic ?salf)) 
Coapoaaats (dissolviag dissolva-stoppad) 
Ealatioas ((aaats dissolviag dissolva-stoppad) ) ) 

(dafSitaatioa dissolviag 
Charactarisatioas ((solid-phasa-chaaga-aoTa ?salf) 
(contiaaous-aoTaaaat ?salf ) 
(■oaotoaic ?salf)) 
Procaasas ((diasolva pil ((Tsolata . saltl) (?solatioa . vatarl))) 

(solatioa ((?solatioa . vatarl)))) 
Dyaaaics ( (Craatar-thaa (A (Aaoaat*-of saltl)) zaro) 
(Dacraasiag (Aaoaat-of saltl)) 
(lacraasiag (Coacaatratioa vatarl)))) 

(dafSituatioa dissolva-stoppad 
ladiTidnals (vatar glass4 vatarl saltl) 
Procasaas ((solatioa ((?soltttioa . vatarl)))) 
Dyaaaics ((Coastaat (Aaoaat-of saltl)) 
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(Constant (Concmtration vattrl)) 
(Equal-to (A (Aaouat-of aaltl)) zaro))) 

f fV Spring o$cilhim§ 

(sMrdafBntity aB-iyi) 
(■MtdafBatity iprixigS) 
(am:dafBatit7 blockS) 
<aM:<iafBntit7 Mchanieal reonitaat? T) 

(dafObiarratioa ipring-maai-oicillator 
Bahavior iFring-Baai**oicillating 
ZadiTidiiali (sa-iyi ipringS blockS) 
World ((Coanactad apringS blockS) 

(Spriag-Naaa-SyataB aa-aya apringS bloekS) 
(Spring apringS) 
(Block blocks) 
(Coapraaaiag-Objact atriagl) 
(Ttaaalatiag-objact blockS))) 

(dafBSapiant apriag-aaaa-oaciUatiag 
Charactarizatioaa (;(aattar-aoTaaant ?aalf) 

; (corpoacular-BOTaaaat ?aalf) 
(aiaoaoidal ?aalf)) 
Coapoaaata (apriag-aaaa-al apriag-Baaa-a2 apriag-aaaa-a3 apring-aaaa-84 
apriag-aaaa-a5 apriag-aaaa-a6 apriag-aaaa-aT apriag-aaaa-aS) 
Ralatioaa ((aaata apriag-aaaa-al apriag-aaaa-a2) 
(aaata apriag-Baaa-a2 apriag-aaaa-a3) 
(aaata apriag-aaaa-a3 apriag-aaaa-a4) 
(aaata apriag-aaaa-a4 aprlag-aaaa-a6) 
(aaata apriag-aaaa-a5 apriag-aaaa-f.9) 
(aaata apriag-aaaa-a6 apriag-aaaa-a7) 
(aaata apriag-aaaa-a7 apriag-aaaa-a8) 
(aaata apriag-aaaa-a8 apriag-aaaa-al))) 

0— [<] — 

(daf Situation apring-aaaa-al 
Procaaaaa ((Spriag ((Tapriag . apringS))) 

(Traaalatiag-Objact ((?obj . blockS))) 
(Appliad-Forca TiO ((Tare . apring5)(?dst . blockS) 

(Tlorca . raatorativa-lorca)(?acc . accalaration))) 
T , (Spring-Maaa-Syataa ((?ayataa . aa-aya)(?apring . 8pring5)(?aaa8 . blocks)))) 

IndiTidnala (apring5 block5) 
Dynaaica ((Dacraaaing (Diaplacaaant apringS)) 
(Dacraaaing (Poaition blockS)) 
(Daeraaaing (Valocity blocks)) 
(Laaa-than (A (Valocity blocks)) ZERO) 
(Graatar^Than (A (Poaition blockS)) saro) 
(6raatar-Than (A (Diaplacaaant apringS)) zaro))) 

— C<o] — 

(dafSitnation apring-aaaa-a2 
Procaaaaa ((Spring ((Tapring . apringS))) 
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(TttMlatlag-Objtct ((?obj . blocksT)) 

(Sptlag-MMi-SyitM ((TiyitM . ■■*iyi)(?ipri3ag . ipring6)(?wii . blocks)))) 
Zadividnali (springs blocks) 
Dyaaaici ((DocroMiag (Diiplacwont ipringS)) 

(DocroMiag (Position blocks)) 

(Coutaat (Volocity blocks)) 

(Ufli-than (A (Volocity blocks)) ZERO) 

(Sqnal-To (A (Poiition blockS)) ZERO) 

(Rqiul-To (A (OiiplacMiont ipringS)) ZERO))) 

III W 

(dofSitnation ipri]ig*Bui*s3 
Procoaioi ((Spring ((Tipring . ipringS))) 

(Ttanslating-Objoct ((?obj . blocks))) 
(Appliod-Forco viO ((?irc . ipringS)(?dit . blocks) 

(?forco . roitorativo-forco)(?acc . accoloration))) 
(Spring-Naai-Syitm ((Tiyitw . n-i7i)(?gpring . ipringS)(?nais . blocks)))) 
ladividnala (springs blocks) 
Dynasica ((Dacraaaiag (Displacanant ipringS)) 
(Dacraaaing (Poaition blocks)) 
(Zncraaaing (Valocity blocks)) 
(Laaa-tbaa (A (Valocity blocks)) ZERO) 
(Laaa-Than (A (Poiition blocks)) aaro) 
(Laaa-Than (A (Oiiplacanant ipriagS)) aaro))) 

;;; □ — o — 
(dafSitnation •priag*aaai*i4 
Procaaaaa ((Spring ((?tpriag . ipringS))) 

(Appliad^Forca riO ((?irc . ■pringS)(?dit • blocks) 

(?forca . raitoratiTa*lorca)(?acc . accalaration))) 
» , (Spring-Maai-Syita. ((?iyitiB . ■■-■yi)(?ipring . ipringS) (?«aia . blocks)))) 

Individnali (apringS blocks) 
Dynaaica ((Conatant (Oitplacwant ipringS)) 
(Conatant (Poiition blocks)) 
(Incraaaing (Valocity blocks)) 
(Eqoal-To (A (Valocity blockS)' ZERO) 
(Laaa-Than (A (Poiition blocks)) aaro) 
(Laia-Tban (A (Oiiplacaaant ipringS)) zaro))) 



;;; — C>]~o — 
(dafSitnation apriag-aaaa--aS 
Procaiiai ((Spring ((Tipiing . ipringS))) 

(TtanalatiBg-Objact ((?obj . blocks))) 
(Appliad-Forca vlO ((?irc . ipringS) (?dit . blocks) 

(?forca . raitorativa-lorca)(?acc . accalera.ion))) 
(Spring-iraaa-Syata« ((?iyitM . in-iyi)(?ipring . springs) (?naii . blocks)))) 
XndiTidnala (ipringS blocks) 
Dynaaica ((Incraaaing (Diaplacaaant ipringS)) 
(Incraaaing (Poaition blocks)) 
(Incraaaing (Valocity blocks)) 
(Graatar-than (A (Valocity blocks)) ZERO) 
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(LMi-Thu (i (Position blocks)) z«ro) 
(LMS-Thw (A (Oisplaemtnt ipringS)) z«ro))) 



;;; — to>] 

(dcfSitnation ipri&g-BM&-g6 
ProcMiM ((Spring ((Tipring . ipringS))) 

(TtwiilatiJig-Objwt ((?obj . blockS))) 

(Spri&g-Nasi-Syitw ((Tiyitw . H-i7i)(?ipring . ipring6)(?mit . blockS));) 
IndiTiduli (springs blocks) 
djMMich (dncTfiasing (Oiiplacwent ipringS)) 

(Incrtasing (Poiition blocks)) 

(Constant (Valocity blocks)) 

(Graatar-than (A (Valoci^y blockS)) ZERO) 

(Bq[aal-To (A (Poiition blockS)^ k^aro) 

(Bqaal-To (A (Oiiplacaaant iprln^^S)) Zaro))) 

-0— [>] 

(dafSitnation ipring-Baai-ir 
Procaiiaa ((Spring ((Tipring . ipringS))) 

(Ttanalating-Objact ((?obj . blocks))) 
(Appliad-Forca riO ((?irc . ■pringS)(?dst . blockS) 

(?lorca . raitoratiTa-lorca)(?ac,. . accal«ration))) 
T , (Spring-Maai-Syita. ((Tiyataa . »-iyi)(?ipring • ipringS)(?naai . blockS)))) 

IndiTiduali (ipringS blockS) 
Oynaiiici ((Incraaaing (Oiiplacamnt ipringS)) 
(Incraaaing (Poiition blocks)) 
(Oaeraaaing (Valocl;;y blocks)) 
(Craatar-tkan (A (Valocity blocks)) ZERO) 
(Graatar-Tkan (A (Poiition blocks)) zaro) 
(Graatar-Than (A (Oiiplacaaant ipringS)} zaro))) 

;;; — 0 □ 

(dafSitnation ipring-uai-i8 
Procaiiai ((Spring ((Tipring . ipringS))) 

(Appliad-Forca riO ((?irc . ipringS)(?dit . blockS) 

(?lorca . raitoratiTa-lorca)(?acc . accalaration))) 
r ^ , (Spring-llaai-Syita« ((?iyit« . «-iyi)(?ipring . springs )(?mai8 . blocks)))) 
IndiTidnali (apringS blocks) 
Oynaaici ((Constant (Oitplacaaant ipringS)) 
(Conatant (Poiition blockS)) 
(Oaeraaaing (Valocity blocks)) 
(Eqaal-To (A (Valocity blocks)) ZERO) 
(Graatar-TLan (A (Position blocks)) zaro) 
(Graatar-Tkan (A (Displacaaant springs)) zaro))) 

;;; Air plant wing 

(sM:daf Entity ving9 :typa physob) 
(sM:daf Entity air-straaB9 ttypa physob) 

(dafObsanration air-plana-ving 
BakaTior plana-ving-lift 
IndiTidoals (ving9 air-straanO) 
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Vorld ((TraeliiBf air*itn«i9 (Tiid« wiag9)) 
(TMehiag alr-itrMB9 (Btid« winga)) 
(FlovlBf-Air alr*rtr«aB9) 
(Hiyiob viag9) 

(biT«lop«d wi]ig9 air*itrm9) 
(obj«ct-p«rt-of (Tiidt wiiig9) wiiig9) 
(obj«ct-part-of (Bsidt wingO) wiag9) 
(Oppoiit«-iidM (Tiid« wing9) (Biid« wing9)) 
( triTatiTt-of (poiition vlag9) (Telocity wiiig9)) 

(Bqnal-to (A (CnrrAtHrt (Biid« wing9))) (A (CurrAturt (Tiidt wing9)))))) 

(dtf Situation plu«-wiiig-lif t 
ChwActtriutioM ((floid-flow ?i«lf)) 
ProcMiM ((air-plaao-wiag ((?wing . wiag9))) 

(air-contact TiO ((?air . air-itroaaO)(?obj . «ing9) 

(?poa-part . (Tiidt wiag9))(?utg-part . (Biido ving9)))) 
(air-plana-lilt Til ((?air . air-itrta«9)(?obj . wixig9)(?part . (Tiidt wing9)))) 
(air-plana-lift ▼i2 ((?air . air-itraaii9)(?obj . wing9)(?part . (Biido wing9)))) 
(appliad-forca-poi piO ((?obj . wing9)(?!>art . (Biido wing9)) 

(?irc . air-itraaa9) ) ) 
(appliad-forco-nog pil ((?obj . wing9)(?part . (T^iidt wing9)) 

(?src . air-itroaa9)))) 

IndiTidnali (air^itraaB9 wiag9) 
Dynaaici ((Constant (Poiition wing9)) 
(Constant (Valocity wing9)))) 
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Appendix E 
Language Declarations 



AL predicates must be declared to SHE prior to use. Each declaration defines the predicate's 
arity, a name and type for each argument, and the next most general type to which the 
predicate maps. Predicates may additionally be declared commutative, in which the order 
of arguments is unimportant when matching, and/or n-ary, in which the predicate can 
take any number of arguments. This section presents a complete listing of the language 
declarations used for PHINEAS. 

;;;; Standard entities 

(•■•:d«f«atity z«ro :typ« nnabcr : constant? t) 
(*B«:d«f entity ataoiphar* :typ« phyiob reonitant? t) 

;;;; Predicates 

;;; Timt 

(■m:d«fPr«<licat« At ((obj«et phyiob) (tia« ti««-int«nral;) lunetion :ezpression-typ« slice) 
(■M:d«f Predicate Situation ((tiae-token ti««-int«rTal)(relatioM set)) relation 

: expreiiion-type ttenporal) 
(■■e:defPredicate Bieg ((tiae-token tiBe-interral)) Attribute) 
(■■e:defPredicate Meets ((before tiae)(after tiae)) Relation 

tdocuentatioa "(Ulore a«te-y: intenral before ends at point vhere interval alter begins. 
(■■e:def Predicate Sunury-of ((parent-tiae tiae)(set set)) relation) 

;;; Dtrivativta 

(•■e:def Predicate Decreasing ((arg linear)) Relat»«) 

(saetdef Predicate Increasing ((arg linear)) Relation) 

(saeidef Predicate Constant ((arg linear)) Relation) 

;;; Ckangt descriptors 
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(•M:d«fPr«dicat« Bzpoii«iti«l*D«ea7 (XQ quantity)) Rtlation) 
(•M:d«fPr«dicat(i BzponratialHSrovtli ((Q qaan'^ity)) Rtlation) 
(tM:d«fPr#dicat« SzpottmtiaX-Approacli ((Q <puntlty)) Relation) 

(iM:d«fPradicat« Bzponmtial^Approaeli-Dual ((Q-dacraasing quantity) (Q-incraating quantity)) 
talation) 

;;; /nefaa/t^y information 

(iMrdafPradicata Equal-To ((argl Linaar) (arg2 linaar)) ralation :coa«ut%tiva? t) 
(aM:dafPradicata Graatar-Tban ((argl linaar) (arg2 linaar)) ralation 

laquivalant ((laaa-than arg2 argl))) 
(aM:dafPradicat« Laaa-Than ((argl linaar) (arg2 linaar)) ralation 

raquiTalant ((graatar^than arg2 argl))) 
(BM.'dafPradicata Onralatad ((argl linaar) (arg2 linaar)) ralation :comtutatiTa? t) 

;;; QnantUies 

;;; Tkf. dafQtypa form i$ a PIIIBAS routine which tnvolreJdafPradicata 
;;; a$ well as identifjfing it as a luantity for PHIIEAS intemal use. 

(dafQtypa Clianga-Rata ((obj phyaob)) function : axpraaaion-typa rata) 

(dafQtypa Asount-of ((obj phyaob)) function : axpraaaion-typa aztanaiva-quantity) 

(dafQtypa taount-of-in ((aub phyaob) (atata 8tata)(containar phyaob)) function 

: azpraaaion*typa aztanaiva-qnantity) 
(dafQtypa Praaaura ((obj phyaob)) function : axpraaaion-typa intanaiva-quantity) 
(dafQtypa Voluaa ((obj phyaob)) function : axpraaaion-typa axtanaiva-quantity) 
(dafQtypa Surfaca-Araa ((obj phyaob)) function : axpraaaion-typa axtanaiva-quantity) 
(dafQtypa Contact-Surfaca ((objl phyaob)(obj2 phyaob)) function 

: axpraaaion«typa axtanaiva-quantity) 

(dafQtypa Praaaura-in ((objact phyaob)) function :axpraaaion«typa intanaiva-quantity) 
(dafQtypa Taq>aratura-in (tobjact phyaob)) function : axpraaaion-typa intanaiva-quantity) 

(dafQtypa Haat (phyaob) function : axpraaaion-typa axtansiva-quanticy) 

(dafQtypa Taaparatura ((obj phyaob)) function : axpraaaion-typa intanaiva-quantity) 

(dafQtypa Tboil (phyaob) function : axpraaaion-typa taaparatura) 

(dafQtypa Naaa ((obj phyaob)) functioa : axpraaaion-typa axtanaiTa-quantity) 
(dafQtypa Moaant-of-Inartia ((obj phyaob)) function : axpraaaion-typa saaa) 

(dafQtypa Poaition ((obj phy&ub)) function : axpraaaion-typa diatanca rapatial rlinaar) 
(dafQtypa Angla ((obj phyaob)) function : axpraaaion-typa diatanca :apatial :angular) 
(dafQtypa Diaplacasant ((obj phyaob)) function : axpraaaion-typa diatanca rapatial rlinaar) 
(dafQtypa Angular-DiapUcaaant ((obj phyaob)) function : axpraaaion-typa diatanca 

:apatial : angular) 

(dafQtypa Valocity ((obj phyaob)) function : axpraacion-typa rata rapatial rlinaar) 
(dafQtypa Angular-Valocity ((obj phyaob)) function r axpraaaion-typa valocity 

rapatial r angular) 

(dafQtypa Accalaration ((obj phyaob)) function r axpraaaion-typa rata rapatial -.linaar) 
(dafQtypa Angular-Accalaration ((obj phyaob)) function r axpraaaion-typa accalaration 

rapatial r angular) 
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(d«fQt7p« Ceq>lirne« ((obj physob)) funetion r^zprMiion-typ^ •zttniiTt-qnintity) 
(d«fQt7p« Concent} mt ion ((obj liquid)) function loxproiiion-typo IntoniiTo-qoantity) 
(dofQtypo Satiir«ti«.n«*Point ((obj liquid)) function :ozproiiion-typo conctntration) 

(dof^ypo Forco«*Eoitoring ((obj physob)) function :ozprtiiion-typo forct) 
(dofQtypo Xat«rnal«*forco ((obj phyiob)) function :oxprciiion-typo forct) 
(dtfQtypo Eoitoratiro-forco ((obj pfayiob)) function roxproii ion-typo forct) 
(dtfQtypt Torqut ((obj phyiob)) function :tzprtiiion-typt forct) 

(dt^Qtypt Entrgy ((obj ph^^ob)) function :tzprti0ion-typt tzttniiTt-quantity) 
(dtfQtypt Kinttic<*Entrgy ((obj phyiob)) function :tzprtiiion-typt tntrgy) 
(dtfQtypt Total-Entrgy ((obj phyiob)) function :txprtiiion-typt tntrgy) 
(dtfQtypt Pottntial-Entrgy ((obj phyiob)) function : tzprtiiioi.-typt tntrgy) 

(dtfQtypt Ltngth ((obj phyiob)) function :txprtiiion-typt diitanct :ipatinl :lintar) 
(dtfQtypt Rtit-Ltngth ((obj phyiob)) function :tzprtiiion-typt ltngth : spatial :lintar) 

(dtfQtypt Chargt ((obj phyiob)) function :tzprtiiion-typt tzttniiTt-quai tity) 

(dtfQtypt Currtnt ((obj phyiob)) function :tzprtiiion-typt ratt) 

(dtfQtypt Voltagt ((obj phyiob)) function :tzprtiiion-typt tzttniiTt-quantity) 

(tM:dtfPrtdicatt Quantity ((obj phyiob)) function) 

(•M:dtfPrtdicatt Eatt ((Q quantity)) function :tzprtiiion-typt quantity) 

(iM:dtfPrtdicatt InttwiTt-Quantity ((quantity quantity)) function :tzprtiiion-typt quantity) 
(mt:dtfPrtdicatt BxttnsiTt-Quaatity ((quantity quantity)) function :tzprtiiion-typt quantity) 
(iM:dtfPrtdicatt Lintar ((nua lintar)) function) 

(iM:dtfPrtdicatt lusbtr ((Q quantity)) function :tzprtiiion-typo lintar) 
(ntzdtfPrtdicatt Ordinal ((Q ordinal)) fnnction :tzprtiiion-typt lintar) 

;;; MiscellaneouM movemenU 

(SM :dtfPrtdicatt Moving-Objtct ((obj phyiob)) rtlation) 

(nt:dtfPrtdicatt Tranilating-Objtct ((obj phyiob)) rtlation :txprtiiion-typt noving-object) 
(•■•:d«fPr«dicat« Botating-Objtct ((obj phyiob)) relation : •zprMiion-typa moving-object) 

(■m:d«fPr«dicat« D«fonwl-ObJ«ct ((obj phyiob)) rilation) 

(iB«:d«fPr«<licat« Twliting-Objict ((obj phyiob)) rilation :«zprMiion-typ« delorned-object) 
(iB«:d«fPr«dicat« Coapriiiiag-Objict ((obj phyiob)) rilation :«zprMiion-typ« deformed-object) 

;,V QP syntax 

(iM:d«fpr«dicat« i ((Q qnantity)) function :«zprMiion-typ« nuabir) 
(n«:d«fpr«dicat« 0 ((Q (jiMatity)) function : •zpriiiion-typi nuibir) 
(n«:d«fpr«dicat« q« ((qoaatity quantity) (izp ■ath-azpriiiion)) relation) 
(iM:d«fPr«dicat« Qprop ((Ql ({aantity) (Q2 quantity)) ralation) 
(n«:d«fPr«dicat« Qprop- ((Ql quantity) (Q2 quantity)) rtlation) 
(iM:d«fPradicat« 1- ((Influancad qnantity) (Influancar nuabar)) ralation) 
(■MSdati^adicata 1+ ((Influancad qnantity) (Influancar nuabar)) relation) 
(iM:dafPradicata CTrani ((lonrca quantity) (dait quantity) (rata quantity)) relation) 
(Ma:dafPr«dicata Exiiti ((obj phyiob)) attribute) 

(iM:defPredicate - ((Q quantity) (font ezpreiiion)) relation :n-ary? T) 
(sm:def Predicate + ((Q quantity )(for« ezpreiiion)) relation) 
(iBe:defPredicate ♦ ((Ql quantity)(Q2 quantity)) relation) 
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(sM:d«fPr«dieatt Proe^ii-D^fiaition proe«ii)(PZ procMi-iaitue^) 

(coai«qii«nc«i ■•nt«ic«)) relation) 
(sM:d«fPr«dieat« Pack«t-D«finitioii ((?-&•■• paek«t)(eoiis«qame«i imtme«)) relation) 
(■M;d«lPr«dieat« Viw-O^finltion (mt«nc«) relation) 
(tM:d«fFr«dieat« ProcMi (utity) attribute) 

(tm;d«fPr«dieat« ladividiial ((indlTidual i tea) (conditions ozproiiion)) relation) 
(sM:dofProdieato Conditions ((condition txprtiiion)) rtlation :coantatiTO? t :n-arj? t) 
(sM:dofPr«dicato ias-Qnantity (quantity ontity) attribnto) 

(8M:dofprodicato Procoii-instanco-of ((procoii-typo procoii)(procoii-toktn pid)) rtlation) 
(sM:dofPr«dicato Activo ((procoii prccoii-instanco)) attribnto : ozpross ion-typo sontonco) 
(sM:dsfProdicato Corroipondonco ((pairl pair)(pair2 pair)) rtlation) 

;;; Lo§ic 

(sMidtfPrtdicatt not (itnttnct) rtlation) 

(sMidtfPrtdicatt Lqplits ((anttctdtnt prtdicatt) (conitqntnt prtdicatt)) rtlation 

;tzprtiiion-typt :isplicational) 
(sM:dtfPrtdicatt Canst ((anttctdtnt tvtnt) (constqntnt tvtnt)) rtlation 

:tx|»'t8sion-typt :iaplicational) 
(tM:dtfPrtdicatt Supports ((anttctdtnt prtdicatt) (constqatnt prtdicatt)) rtlation 

: tzprtssion-typt :isplicational) 
(sM:dtfprtdicatt or ((disjunct stnttnct) (disjunct stnttnct)) logical 

:n-ary? T :conntatiTt? T :rtlgroi9? T) 
(siit:dtfPrtdicatt and ((conjunct stnttnct) (conjunct stnttnct)) logical 

:n-ary? T :coMmtatiTt? T :rtlgroi9? T) 

;;; Proce$$ fuMtitieM 

(dtfQtypt ?apori«ation-Ratt ((obj j^ysob)) function :tzprtssion-typt ratt) 
(dtfQtypt Htat-now-Ratt ((obj physob)) function :tzprtssion-typt ratt) 
(dtfQtypt nov-Eatt ((obj physob)) function :tzprtssion-typt ratt) 
(dtfQtypt EtstoratiTt ((obj physob)) function :tsprtssion-typt numbtr) 
(dtfQtypt Etpltnish-Eatt ((obj physob)) function :tzprtssion-typt ratt) 
(dtfQtypt Absorption ((obj physob)) function :tzprtssion-typt nuabtr) 
(dtfQtypt OissolTt-Eatt ((obj physob)) function :tzprtssion-typt nuibtr) 

;;; ObjecU 

(sM:dtfPrtdicatt Physob ((obj physob)) attributt) 

(sM:dtfPrtdicatt Spring ((obj physob)) attributt :tzprtssion-typt physob) 
(sM:dtfPrtdicatt Block ((obj physob)) attributt : tzprtssion-typt physob) 
(siM:dtfPrtdic».tt Inductor ((obj physob)) attributt : tzprtssion-typt physob) 
(sM:dtfPrtdicatt Capacitor ((obj physob)) attributt : txprtssion-typt physob) 
(sM:dtfPrtdicatt Containtr ((obj physob)) attributt : txprtssion-typt physob) 
(sM :dtfPrtdicatt Pan ((obj physob)) attributt rtsprtssioa-typt containtr) 
(sM:dtfPrtdicatt Pipt ((obj physob)) attributt : tzprtssion-typt physob) 
(sM:dtf Prtdicatt StoTt ((obj physob)) attributt : tzprtssion-typt physob) 
(sat :dtf Prtdicatt Btaktr ((obj physob)) attributt : tzprtssion-typt physob) 
(sat:dtfPrtdicatt Ball ((obj physob)) attributt : tzprcssion-typt physob) 
(sM:dtfPrtdicatt String ((obj physob)) attributt : tzprtssion-typt physob) 

;;; Set notation 
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(•M:d«fPr«dieat« Sat ((■•t-it«a pbyiob)) relation :coanttatiT«? T :n-ary? T :r«lgroQp? T) 
(•M:d«fPr«dicat« Pair ((objl ph7tob)(obj2 phyiob)) relation :eoBmtatiT«? T) 
(■M:d«fPr*«icat« Ord«r«d-Pair ((objl phyiob)Cobj2 phytob)) relation) 

;;; Phase state* 

(•M:d«f?r«dicat« Pliac* ((■Qbctaac* phyiob)) relation 

:doeiiB«itation "(subitaaca): top-l«v«l nod* r«pr«i«nting tQbttane* it in toM phaio itate") 
(iM:d«fPr«dicat« Solid ((nbstaac* physob)) relation :«zpr«ssion-typ« phaaa) 
(■M:d«fPr«dicat« Liqaid ((subitanc* phyiob)) relation : •xpr«iiion-typ« phase) 
(iM:d«fPr«dicat« 6aa ((lubitanc* phyiob)) relation : azprciiion-type phase) 

/;; i»tt<*« 

(sM:d«fPr«dicat« coftdnit ((path physob)) relation) 

(sM:d«fPrtdicat« fluid-path ((path physob)) relation :ezpression-type conduit) 
(sM:defPredicate heat-path ((path physob)) relation : expression-type conduit) 
(sM:defPredicate closed-path ((object physob) (ead-points pair)) relation) 

;;/ Path aUgnmeni 

(sM:defPredicate Aligned ((generic-path path)) attribute 

:docuBentatio t "(generic-path): will currently allow its substance to pass through") 
(sM:defPredicate leat-Aligned ((heat-path heat-path)) attribute : express ion-typ« aligned 

:docuB«itatioii "(heat-path): the specified object is currently a FOICTIOIAL heat path") 
(sM:defPredicate Fluid-Aligned ((fluid-path fluid-path)) attribute : expression-type aligned 

:docuHentatioa "(fluid-path): the specified object is currently a FOICTIOIAL fluid path") 
(sm:defPredicate 6as-Align«d ((gas-path gas-path)) attribute : expression-type aligned 

:docua«itatioa "(gas-path): the specified object is currently a FUICTIOIAL gas path") 

;;; Spatial relationships 

(sM:defPredicate Physical-Proxi»ity ((object physob) (object physob)) relation : coMrutative? T) 
(sM:defPredicate Physical-Path ((objl physob) (obj2 physob) (path physob)) relation 
: expression-type physical-proxiaity) 

(sM:defPredicate Touching ((obj physob) (obj physob)) relation 

: expression-type physical-proxiaity :conmtative? T) 

<SM:def Predicate Connected ((objl physob) (obj 2 physob)) relation 

: expression-type physiical-proxiaity :conutative? T) 
(sr>:def Predicate Connection (physob physob physob) relation) 

(sM:defPredicate Comm-Face ((objl solid)(obj2 solid)) function : expression-type solid) 

;;; Containment 

(sM:def predicate Contalnaent ((containee physob) (container physob)) relation 

: expression-type physical-proxiaity) 
(8Bs:defpredicate Coutained-in ((containee fluid) (container physob)) relation 

: expression-type containaent) 
(sae:def predicate Absorbod-in ((absorbee liquid) (absorber solid)) relation 

: expression-type containaent) 
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(•M:d«fpr«dicat« Disp«rs«d-ia ((solutt flixid)(solTMt fluid)) relation 

:«iqpr«ssioa-t7p« eontaiiuiwt) 
(•M:dafpr«dicat« JwmmmA^in ((eontainaa solid) (ccataiatr liquid)) ralation 

:azprassioii-typ« physical-proziaity) 
(•M:dafpradicata noating-ia ((eoataiuM solid) (eontaiaar liquid)) ralation 

raxprassioa-typo physical-proziaity) 

(sM:dafpradica«a Coataiaad-Flaid ((cs aatity)(sttb sabstanca)(can containar)) ralation) 
(dafCosqpoaad-Objact Containad-Fluid) 

(sM:dafpradicata Coataiaad-Stuff ((cs aatity)) ralation) 
(dafCoapoaadHIbjact Containad-Staff ) 

(sM:dafpradicata Containad-Liqaid ((d liquid)) ralation : azprassion-typa containad-stuff ) 
(dafCoapouadHIbJact Coataiaad-Liquid) 

(sM:dafpradieata CoataiaadHSas ((eg gas)) ralation :axprassion-typa contaiaad-stulf ) 
(dafCo^uadHIbJact Coataiaad-Gas) 

(sM:dafPradicata AtMsphara ((air gas)) ralatioa :azprassioa-typa coataiaad-gas) 

(sM:dafpradicata caa-coataiaaaat ((coataiaar aatity)(sub aatity)) ralatioa) 
(sM:dafpradicat4» caa-coataia ((eontaiaar aatity)(sub aatity)) ralatioa 

:axprassioa-typa caa-eoatainaaat) 
(aM:dalpradieata eaa-absorb ((ftbsorbar solid) (absorbaa fluid)) ralatioa 
:axprassioa-typa caa^-eoataiaaaat) 

(sM:dafPradieata Solutioa ((solutioa liquid)) ralatioa) 
(sM:dafPradieata lasolubla ((obj solid)) ralatioa) 
(sMrdafPradieata Solubla ((obj solid)) ralatioa) 

(sM:dafPradieata Solubla-ia ((soluta solid) (solvaat liquid)) raUtioa) 
Mi$ctlhnto%9 

(sM:dafPradieata Aetiva-Sourea (physob) attributa :axprassioa-typa boolaaa) 
(sM:dafPradieata Puaetioa-Of ((ql liaaar)(q2 liaaar)) Ealatioa) 
(sM:dafPradicata stata ((at iaaaiaata)) attributa) 
(sM :dafPradieata substaaea ((sub iaaaiaata)) attributa) 

(saa :dsfPradieata watar ((substaaea physob)) attributa :azpr ass ioa-typa physob) 

(saa :dafpradieata holds-liquid ((obj physob)) ralatioa) 
(sM:da^adieata eoataiaar*-of ((es aatity) (coataiaar aatity)) raUtioa) 
(saa:dafpradicata substaaea-of ((es aatity) (substaaea substaaea)) ralatioa) 
(sM:dafpradieata stata-of ((es aatity) (stata stata)) ralatioa) 

(sM:dafpradieaf;a VolUM-Solid ((obj physob)) ralatioa) 
(8aa:dafpridieata TharMLl-»objaet ((obj physob)) ralation) 

(sM:dafpradieata opaa ((eaa coataiaar)) ralatioa) 
(sM:da^radieata elosad-eoataiaar ((eaa coataiaar)) r.latioa) 
(sM:dafpradieata hola**of ((eaa coataiaar) (hola physob)) ralatioa) 

(3aa:dafpradieata b-asplaias ((thaorias (sat thaorias)) (situatioa tiaa-iatarval)) ralatioa 
:docuaaatatioa *«(tji%orias situatioa): thaorias axplaia» via b-aaalogy» tha obsarvatioa.") 
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(MM:d«fpr«dieat« flow-procMi (pid) relation) 
(tM:d««Prtdic4t« liquid-iiak ((obj phyiob)) attribute) 
(■M:d«fFr«dicmt« liMt-ioiirc« (•atity) attribute) 
(SM :d«fPr«dicmt« sourer (pid mtity) relation) 

(sM:d«fPr«dic«t« d^itlnation ((pid proe^ii-initane^Xntity phyiob)) rtlation) 

(sM:d«fPr«dieat« ii» (tatity aatity) rtlation) 

(sM:dafPr«dieat« lorct ((from phyiob) (to phyiob)) rolation) 

(tM:d«fPr«dicat« lorco-^lication ((IroE qiiaatity){to quantity)) rslation) 

(noxdtfPridicata foreo-typo (forea-aaM) ralatioa) 

(iMidafPrtdieata dt ((Q quantity) (DariTativa quantity)) ralation) 

(•M:dafPradicata darivatiTa-ol ((Q quantity) (DarivatiTa quantity)) ralation) 

(aM:dafPridicata Spring*Haai-Syitra (dyitw antity)(ipring ipring) (block phyiob)) ralation 

:ai^aiiion*t7pa objaet-iyitan) 
(dafCoqpound-ObJaet Spring-Maia-Syitaa) 

(aM:dafPridicata obJact*iyatM (antity antity antity) ralation) 

(iM:dafPridicata coanactid-to-ipring ((aaii phyiob) (ipring ipring)) ralation) 
(iaa:dafPridicata linuaoilal ((Q quantity)) ralation) 

(iM:dafPridieata 90-d«graa-dalay ((rafarant quantity) (dalayad quantity)) ralation) 
(iM:dafPridieata aobila ((obj phyiob)) ralation) 
; ; ; Obacura QPE naada 

(iMzdafpridicata PIO ((arg argUMnt)(Talua arguatnt)) ralation) 
(raaidafpridicata PIl ((arg argUMnt)(Talua argnaant)) ralation) 
(iMidafpradicata PI2 ((arg argUBant)(Talua arguaant)) ralation) 
(iM:dafEntity lourea :conatant? T :typa :procaaa*arg) 
(na:dafEntity daitination :conatant? T :typa :procaii-arg) 
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